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TITLE OF INVENTION 
RECOMBINANT HIGH MOLECULAR WEIGHT MAJOR OUTER MEMBRANE 

PROTEIN OF MORAXELLA 

FIELD OF INVENTION 
5 The present invention relates to the field of 

immunology and is particularly concerned with outer 
membrane proteins from Moraxella, methods of 
recombinant production thereof, genes encoding such 
proteins and uses thereof. 

10 BACKGROUND OF THE INVENTION 

Otitis media is the most common illness of early 
childhood with approximately 70% of all children 
suffering at least one bout of otitis media before the 
age of seven. Chronic otitis media can lead to hearing, 

15 speech and cognitive impairment in children. It is 
caused by bacterial infection with Streptococcus 
pneumoniae (approximately 50%) , non-typable Haemophilus 
influenzae (approximately 3 0%) and Moraxella 
(Branhamella) catarrhalis (approximately 20%) . In the 

2 0 United States alone, treatment of otitis media costs 
between one and two billion dollars per year for 
antibiotics and surgical procedures, such as 
tonsillectomies, adenoidectomies and insertion of 
tympanostomy tubes. Because otitis media occurs at a 

2 5 time in life when language skills are developing at a 

rapid pace, developmental disabilities specifically 
related to learning and auditory perception have been 
documented in youngsters with frequent otitis media. 

M. catarrhalis mainly colonizes the respiratory 

3 0 tract and is predominantly a mucosal pathogen. Studies 

using cultures of middle ear fluid obtained by 
tympanocentesis have shown that M. catarrhalis causes 
approximately 20% of cases of otitis media (ref. 1 - 
Throughout this application, various references are 
35 referred to in parenthesis to more fully describe the 
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state of the art to which this invention pertains. Full 
bibliographic information for each citation is found at 
the end of the specification, immediately preceding the 
claims. The disclosures of these references are hereby 
5 incorporated by reference into the present disclosure) . 

The incidence of otitis media caused by M. 
catarrhalis is increasing. As ways of preventing otitis 
media caused by pneumococcus and non-typable H. 
influenzae are developed, the relative importance of M. 
10 catarrhal is as a cause of otitis media can be expected 
to further increase . 

M. catarrhalis is also an important cause of lower 
respiratory tract infections in adults, particularly in 
the setting of chronic bronchitis and emphysema (refs. 
15 2, 3, 4, 5, 6, 7, and 8) . M. catarrhalis also causes 
sinusitis in children and adults (refs. 9, 10. 11, 12, 
and 13) and occasionally causes invasive disease (refs. 
14, 15, 16, 17, 18, and 19) . 

Like other Gram- negative bacteria, the outer 

2 0 membrane of M. catarrhalis consists of phospholipids, 

lipopolysaccharide (LPS) , and outer membrane proteins 
(OMPs) . Eight of the M. catarrhalis OMPs have been 
identified as major components. These are designated by 
letters A to H, beginning with OMP A which has a 
25 molecular mass of 98 kDa to OMP H which has a molecular 
mass of 21 kDa (ref . 20) . 

Recently, Klingman and Murphy purified and 
characterized a high molecular-weight outer membrane 
protein of M. catarrhalis (ref. 21). The apparent 

3 0 molecular mass of this protein varies from 350 kDa to 

72 0 kDa as judged by sodium dodecyl sulfate - 
polyacryl amide gel electrophoresis (SDS-PAGE) . This 
protein appears to be an oligomer of much smaller 
proteins or subunits thereof of molecular mass about 12 0 
3 5 to 14 0 kDa and is antigenically conserved among strains 
of Moraxella. 
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Helminen et al also identified a protein of 
molecular mass of about 3 00 to 4 00 kDa, named UspA, that 
was reported to be present on the surface of Moraxella 
(ref . 22) . 

5 In WO 96/34960 and US Patent No. 5,808,024, 

assigned to the assignee hereof and the disclosure of 
which is incorporated herein by reference, there is 
described a new protein of M. catarrhalis which had an 
apparent molecular mass of about 2 00 kDa. Western blot 

10 analysis using antiserum raised against the 200 kDa 
protein suggested that this protein was different from 
the large UspA protein (> 3 00 kDa) , reported by the two 
groups in ref s . 21 and 22. Recently, the gene sequences 
encoding two related proteins, UspAl and UspA2 , have 

15 been published (ref. 23) . A sequence comparison between 
the two genes encoding the UspA proteins and the gene 
encoding the 2 00 kDa protein confirmed that the 2 00 kDa 
protein is different from either of the UspAl and UspA2 
proteins . 

20 Fitzgerald et al (ref. 29) have identified a 200 

kDa protein associated with haemagglutination. 
Transmission electron microcopy studies (ref. 30) showed 
that the 200 kDa protein associated with 
haemagglutination is present on the outer fibrillar 

2 5 layer of M. catarrhalis . Recently, a non-clumping 

variant of strain 4223 was prepared by serial passaging 
and it was observed that the non- clumping variant had 
reduced expression of both UspA and a 2 00 kDa protein 
that is not UspA (ref. 31) . It is possible that this 200 

3 0 kDa protein is the same as that described in WO 96/34 960 

and herein. 

The 200 kDa protein described herein has been 
detected in most, but not all, strains of Moraxella 
catarrhalis, which have been isolated from various 
35 sources, including otitis media (OM) , sputum, 
nasopharynx, expectorate and bronchial secretions. Table 
1A below contains a listing of M. catarrhalis strains 
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tested, their source and whether or not the 2 00 kDa 
protein is expressed. 

Af. ca.ta.rrha.lis infection may lead to serious 
disease. It would be advantageous to provide recombinant 
5 means for providing large quantities of 2 00 kDa outer 
membrane protein of M. catarrhal! s strains and genes 
encoding such proteins from various AT. catarrhal is 
strains for use as antigens in immunogenic preparations 
including vaccines, carriers for other antigens and 
10 immunogens and the generation of diagnostic reagents. 

SUMMARY OF THE INVENTION 
The present invention is directed towards the 
provision of a recombinantly-produced purified and 
isolated outer membrane protein of Moraxella catarrhalis 
15 and other Moraxella strains, having an apparent 
molecular mass of about 200 kDa, as well as genes 
encoding the same from various strains of Moraxella 
ca tarrhalis . 

In one aspect of the present invention, there is 
2 0 provided an isolated and purified nucleic acid molecule 
having (a) a nucleotide sequence set forth in Figure 3, 

4 or 5 (SEQ ID Nos : 5, 6, 8, 9, 11, 12) for Moraxella 
catarrhalis strains 4223, Q8 and LES-1 respectively or 
the complementary sequence thereto; (b) a nucleotide 

2 5 sequence encoding an about 2 00 kDa outer membrane 
protein of a strain of Moraxella catarrhalis and having 
the derived amino acid sequence shown in Figures 3 , 4 or 

5 (SEQ ID Nos: 7, 10, 13) for Moraxella catarrhalis 
strains 4223, Q8 and LES-1 respectively; and (c) a 

30 nucleotide sequence encoding an about 200 kDa outer 
membrane protein of another strain of Moraxella 
catarrhalis which is characterized by a tract of 
consecutive G nucleotides which is 3 or a multiple 
thereof in length, an ATG start codon about 8 0 to 9 0 bp 

35 upstream of said tract and said tract being located 
between about amino acids 2 5 and 3 5 encoded by the 
nucleotide sequence . 
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The another strain of Moraxella catarrhal is in (c) 
is a strain as identified in Table 1A other than strains 
4223, Q8 and LES-1 and expressing an about 2 00 kDa 
protein . 

5 in another aspect of the invention, there is 

provided (a) a nucleotide sequence set forth in 
Figure 8 (SEQ ID No: 12) for a 5 1 -truncation of the gene 
encoding an about 2 00 kDa outer membrane protein of 
Moraxella catarrhalis strain 4223; (b) a nucleotide 

10 sequence encoding the derived amino acid sequence set 
forth in Figure 9 (SEQ ID No: 13) for a N- terminal 
truncation of an about 2 00 kDa outer membrane protein of 
Moraxella catarrhalis strain 4223; and (c) a nucleotide 
sequence encoding a 5 1 -truncation of a gene encoding an 

15 about 2 00 kDa outer membrane protein of another strain 
of Moraxella catarrhalis and being capable of expressing 
the corresponding N- terminally truncated about 20 0 kDa 
outer membrane protein from E, coli. 

A further aspect of the invention providing an 

2 0 isolated and purified nucleic acid molecule which is a 

contiguous Nde I - Pst I fragment of SEQ ID No : 5. 

The invention, in an additional aspect, provides a 
vector for transforming a host comprising a nucleic acid 
molecule as provided herein, which may be a plasmid 
25 vector. The plasmid vector may be one which has the 
identifying characteristics of pKS348 (ATCC 203,529) or 
pKS294 (ATCC 203,528). The plasmid vector also may be 
one having the identifying characteristics of pQWE or 
pQWF. 

3 0 A further aspect of the invention provides a host 

cell, such as E. coli, transformed by a vector provided 
herein and expressing an about 200 kDa protein of a 
strain of Moraxella catarrhalis or an approximately C- 
terminal half thereof. The invention further provides, 
3 5 in an additional aspect, a recombinant about 2 00 kDa 
outer membrane protein of a strain of Moraxella 



catarrhalis or an approximately C-terminal half thereof 
producible by the transformed host provided herein. 

The recombinant about 2 00 kDa outer membrane 
protein or an approximately C-terminal half thereof may 
5 be formulated into an immunogenic composition, which may 
be formulated as a vaccine for in vivo administration to 
protect against disease caused by Moraxella ca.tarrha.lis , 
which may be provided in combination with a targeting 
molecule for delivery to specific cells of the immune 
10 system, formulated as a microparticle, capsule or 
liposome preparation, and may further comprise an 
ad j uvant . 

The invention, in a further aspect, includes a 

method of inducing protection against disease caused by 
15 Moraxella catarrhalis by administering to a susceptible 

host, which may be a human, an effective amount of the 

immunogenic composition provided herein. 

In an additional aspect, the invention provides a 

method for the production of an about 2 00 kDa outer 
2 0 membrane protein of a strain of Moraxella catarrhalis or 

an approximately C-terminal half thereof, which 

comprises : 

transforming a host cell, such as E. coli, with a 
vector as provided herein, 
25 growing the host cell to express the encoded about 

200 kDa protein or an approximately C-terminal half 
thereof, and 

isolating and purifying the expressed about 2 00 kDa 
protein or an approximately C-terminal half thereof. 
3 ° The encoded about 200 kDa protein may be expressed 

in inclusion bodies. The isolation and purification of 
the about 2 00 kDa protein may be effected by: 

disrupting the grown transformed cells to produce 
supernatant and the inclusion bodies, 
35 solubilizing the inclusion bodies to produce a 

solution of the recombinant about 2 00 kDa protein, 
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chromotographically purifying the solution of 
recombinant about 2 00 kDa protein free from 
contaminating proteins, and 

isolating the purified recombinant about 200 kDa 
5 protein. 

BRIEF DESCRIPTION OF DRAWINGS 
Figure 1 shows restriction maps of subclones of a 
gene encoding the 200 kDa outer membrane protein of AT. 
catarrhalis from A,EMBL3 clone 811 and the location of 

10 PCR primers used to amplify the 5 1 -region of the gene. 
The open reading frame of the about 2 00 kDa outer 
membrane protein is indicated by the shaded box. The 
numbers in parenthesis are approximate sizes of DNA 
inserts in plasmids . Restrictions sites are Sal: Sail, 

15 N: Ncol, B: Eg 111 r K: Kpnl , Xb : Xbal , Xh : Xhol , RV: 
.EcoRV; 

Figure 2 shows the nucleotide sequence (SEQ ID No: 
1 - entire sequence, SEQ ID No : 2 - coding sequence) of 
the gene encoding the about 2 00 kDa outer membrane 

20 protein of M. catarrhalis strain 4223, as determined 
from XEMBL3 clone 811, and deduced amino acid sequence 
(SEQ ID No: 3 - identified GTG start codon, SEQ ID No: 4 
- putative ATG start codon shaded) of the about 20 0 kDa 
outer membrane protein. A ten-G nucleotide segment of 

25 the 5'-UTR is identified by underlining. An ATG start 
codon for the same sequence but with a nine-G nucleotide 
segment is identified by a shaded box (see Figure 3) ; 

Figure 3 shows the nucleotide sequence (SEQ ID No: 
5 - entire sequence, SEQ ID No : 6 - coding sequence) of 

3 0 the gene encoding the about 200 kDa outer membrane 
protein of M. catarrhalis strain 4223, as determined 
from PCR-amplif ied genomic DNA of strain 4223 and the 
deduced amino acid sequence (SEQ ID No: 7) of the 
corresponding about 200 kDa outer membrane protein. A 

35 nine-G nucleotide segment of the sequence corresponding 
to the 10 -G nucleotide segment of Figure 2, is 
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identified by underlining. The GTG start codon 
identified in Figure 2 is identified by a light box; 

Figure 4 shows the nucleotide sequence (SEQ ID No: 
8) of the gene encoding the about 2 00 kDa outer membrane 
5 protein of AT. catarrhalis strain Q8 and the deduced 
amino acid sequence (SEQ ID No: 9) of the corresponding 
about 2 00 kDa outer membrane protein. A nine-G 
nucleotide segment is identified by underlining; 

Figure 5 shows the nucleotide sequence (SEQ ID No: 

10 10) of the gene encoding the about 2 00 kDa outer 
membrane protein of M. catarrhal is strain LES-I and the 
deduced amino acid sequence (SEQ ID No: 11) of the 
corresponding about 2 00 kDa outer membrane protein. A 
three-G nucleotide segment is identified by underlining; 

i5 Figure 6 contains an alignment of the amino acid 

sequence (in single letter code) of the about 200 kDa 
proteins of AT. catarrhalis strain 4223 (SEQ ID No: 7) , 
Q8 (SEQ ID No: 9) and LES-I (SEQ ID No: 11). The 
alignments of the sequences were made using BLAST and 

20 manual methods and are compared to the 4223 sequence. 
Gaps in the sequence where no corresponding or related 
amino acid exists are designated by " - " while identical 
amino acids are designed by " . " ; 

Figure 7 shows the restriction sites of the AT. 

25 catarrhalis strain 4223 derived 200 kDa protein gene as 
well as the identity of various plasmids containing 
partial or full length 200 kDa genes; 

Figure 8 shows the nucleotide sequence (SEQ ID No: 
12) and deduced amino acid sequence (SEQ ID No: 13) of 

30 the 5' -truncated gene encoding the M56 200 kDa protein 
of M. catarrhalis strain 4223 contained in pKS348; 

Figures 9A and 9B contain a schematic of the 
procedure for producing plasmid pKS2 94 expressing the 
full length 200 kDa protein of AT. catarrhalis strain 

35 4223; 
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Figure 10 is a schematic of the procedure for 
producing plasmid pKS348 expressing the N-truncated M56 
r200 kDa protein of M. catarrhalis strain 4223; 

Figure 11 shows a schematic procedure for the 
purification of recombinantly-produced 200 kDa protein 
from E. coli; 

Figure 12 shows SDS-PAGE analysis of the expression 
of M56 r200 kDa protein gene from E. coli. M. 
catarrhalis strain 4223 lysate was run as a positive 
control (a) and uninduced KS358 cultured overnight was 
run as a negative control (b) . In each lane, 20 of 
total protein was loaded; 

Figure 13 shows the SDS-PAGE analysis of the 
purification of the M56 r200 kDa protein according to 
the scheme of Figure 11. Lane 1, E. coli whole cells; 
Lane 2, soluble proteins after 50 mM Tris/NaCl, pH8, 
extraction; Lane 3, soluble proteins after Tris/Triton 
X-100/EDTA extraction; Lane 4, soluble proteins after 
Tris/OG extraction; Lane 5, pellet after Tris/OG 
extraction; Lanes 6, 1 , purified 200 kDa proteins- 
Figure 14 shows the anti-M56 r2 00 kDa protein 
antibody titers obtained in mice. Mice were immunized on 
day 1, day 29 and day 43 with 0.3 ^g, 1 j^g, 3 jj,g or 10 
|Lig of the purified M56 r2 0 0 kDa protein in adjuvant. 
Antisera were obtained on days 14, 28, 42 and 56 and 
anti-M56 r200 kDa protein IgG titers were determined. 
The reactive titers of antisera were defined as the 
reciprocal of the dilution consistently showing a two- 
fold increasing in absorbance over that obtained with 
the pre-bleed serum sample collected on day 0; 

Figure 15 shows the anti-M56 r2 00 kDa antibody 
titers in guinea pigs . Guinea pigs were immunized and 
antisera were analyzed according to the protocol of 
Figure 14 ; 

Figure 16 shows the location of PCR primers used to 
amplify a DNA fragments carrying portions of the 200 kDa 
protein gene from chromosomal DNA of M. catarrhalis 
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strain RH408, a spontaneous mutant of strain 4223 which 
does not produce the 2 00 kDa protein; 

Figure 17 is a partial nucleotide and derived amino 
acid sequence for the 200 kDa protein of M. catarrhalis 
strain 4223, indicating by arrows the locations of the 
initial amino acid of the respective three truncations 
ALA 12 , VAL 19 and GLY 39 ; 

Figure 18 shows schematic diagrams for two 3 1 half 
clones of the 4223 200 kDa gene. Clone pQWE contains a 
fusion between the 5 r end of the 200 kDa gene and the 3' 
half of the gene. Clone pQWE contains the 3' half of 
the gene alone. The location of the PCR primers used to 
generate pQWF is indicated. 

Figure 19 is a construction diagram for producing 
plasmid pQWE expressing a C-terminal portion of the 200 
kDa protein of M. catarrhalis strain 4223 fused to the 
N- terminus ; and 

Figure 2 0 is a construction diagram for producing 
plasmid pQWF expressing a C-terminal portion of the 200 
kDa protein of M. catarrhalis strain 4223. 

GENERAL DESCRIPTION OF THE INVENTION 

In WO 96/34960 (Figure 6) , the sequence of a cloned 
gene from M. catarrhalis 4223 encoding an about 200 kDa 
protein, was described. The open reading frame was 
predicted to start at a GTG codon. Sequence analysis of 
2 00 kDa genes from additional strains, suggested that a 
slightly longer open reading frame was more generally 
found. A re -examination of the sequence from the lambda 
phage -derived 2 00 kDa gene confirmed the GTG start codon 
and an upstream stretch of 10 G nucleotides in a G 
tract. However, when sequence analysis was performed on 
4223 genomic PCR-amplif ied subclones, the longer open 
reading frame was found starting from an ATG codon. The 
G-tract was found to contain 9 G nucleotides in the 
chromosomal gene. An additional G nucleotide had been 
inserted during cloning from the phage library. Analysis 
of the 5' end of the 200 kDa gene from 24 strains 



suggests that the number of G nucleotides in the G tract 
acts as regulator of expression. 

Utilizing the techniques described herein, the 
genes encoding the about 2 00 kDa protein from M. 
catarrhalis strains Q8 and LES-1 have been cloned and 
sequenced. Figures 4 and 5 show respectively the 
nucleotide and derived amino acid sequences. An amino 
acid sequence comparison of the derived amino acid 
sequences of the 2 00 kDa protein from the three strains 
of M. catarrhal is is contained in Figure 6. 

Based on the sequence information, a plasmid 
(pKS2 94) was constructed that contained the full-length 
200 kDa protein gene of strain 4223 starting at the ATG 
codon, under control of the bacteriophage T7 promoter. 
However, even a basal level of expression of the full- 
length gene from the ATG was lethal to E. coli. Deletion 
of a 165 bp 5 ! fragment of the 200 kDa coding region 
greatly reduced the toxicity of the resultant protein to 
E. coli. Plasmid pKS348 contains ' the T7 promoter 
transcriptionally driving a 2 00 kDa protein gene which 
starts at amino acid residue 56. The V56 codon was 
changed to M56. The M56 r200 kDa protein was produced 
and the purified protein was used to generate guinea pig 
antiserum. 

In WO 96/34960, a bactericidal antibody assay was 
described that was used to demonstrate that anti-200 kDa 
antibody was bactericidal for M. catarrhalis. The assay 
was used herein to demonstrate broad bactericidal 
antibody activity against heterologous clinical isolates 
from different geographical locations, by anti-M56 r200 
kDa antibody. A single anti-M56 r200 kDa antibody was 
lytic for 62% of strains tested. 

The 200 kDa protein was originally identified as a 
putative adhesin when its presence was detected in a 
clumping strain, but not a non-clumping derivative. In 
order to determine whether it were truly an adhesin, an 
in vitro adherence assay was developed in which the 
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inhibition of binding by antibody between M. catarrhalis 
and epithelial cells was measured. Using this assay, 
anti-M56 r200 kDa antibody was capable of inhibiting 
adherence of the homologous strain by 4 8%, demonstrating 
5 that the 2 00 kDa protein was an adhesin. When an 
additional 25 strains of M. catarrhalis were assayed, 21 
were found to have reduced adherence to epithelial cells 
in the presence of anti-M56 r200 kDa antibody. 19 of 
these strains had not been killed by the same antibody. 
10 Thus, a single anti-M56 r200 kDa antibody was capable of 
killing or blocking adherence of 91% of the strains 
tested. 

The sequence comparison for the 2 00 kDa gene from 
three strains of M, catarrhalis showed that the C- 

15 terminal half of the protein was quite conserved. Strain 
LES-1 contained an insert of about 3 00 amino acids. 
Thus, based upon the C-terminal region, the strains may 
be divided into two families depending upon whether they 
contained the insert 4223 and Q8 formed one family while 

20 LES-1 formed the other. The carboxy terminal halves (3 1 
halves) of the 4223 or LES-1 200 kDa genes were 
expressed in E. coli with good yields and the purified 
carboxy terminal half of the proteins were used to 
generate antibodies. When tested in the bactericidal 

2 5 antibody assay, these antisera were bactericidal, as 

seen in Table IB. 

It is clearly apparent to one skilled in the art, 
that the various embodiments of the present invention 
have many applications in the fields of vaccination, 

3 0 diagnosis, treatment of Moraxella infections, and in the 

generation of immunological reagents. A further non- 
limiting discussion of such uses is further presented 
below. 

1. Vaccine Preparation and Use 

35 Immunogenic compositions, including those suitable 

to be used as vaccines, may be prepared from the about 
200 kDa outer membrane protein as disclosed herein, as 
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well as immunological fragments and fusions thereof, 
which may be purified from the bacteria or which may be 
produced recombinantly . The vaccine elicits an immune 
response in a subject which produces antibodies, 
5 including anti-200 kDa outer membrane protein antibodies 
and antibodies that are opsonizing or bactericidal. 
Should the vaccinated subject be challenged by Moraxella 
or other bacteria that produce proteins capable of 
producing antibodies that specifically recognize 2 00 kDa 
10 outer membrane protein, the antibodies bind to and 
inactivate the bacterium. Furthermore, opsonizing or 
bactericidal anti-200 kDa outer membrane protein 
antibodies may also provide protection by alternative 
mechanisms . 

15 Immunogenic compositions including vaccines may be 

prepared as injectables, as liquid solutions or 
emulsions. The about 200 kDa outer membrane protein may 
be mixed with pharmaceutical^ acceptable excipients 
which are compatible with the about 200 kDa outer 

2 0 membrane protein. Such excipients may include, water, 

saline, dextrose, glycerol, ethanol, and combinations 
thereof. The immunogenic compositions and vaccines may 
further contain auxiliary substances, such as wetting or 
emulsifying agents, pH buffering agents, or adjuvants to 
25 enhance the effectiveness thereof. Immunogenic 
compositions and vaccines may be administered 
parenterally, by injection subcutaneously or 
intramuscularly. Alternatively, the immunogenic 
compositions formed according to the present invention, 

3 0 may be formulated and delivered in a manner to evoke an 

immune response at mucosal surfaces. Thus, the 
immunogenic composition may be administered to mucosal 
surfaces by, for example, the nasal or oral 
(intragastric) routes. Alternatively, other modes of 
3 5 administration including suppositories and oral 
formulations may be desirable. For suppositories, 
binders and carriers may include, for example, 
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polyalkalene glycols or triglycerides. Oral formulations 
may include normally employed incipients such as, for 
example, pharmaceutical grades of saccharine, cellulose 
and magnesium carbonate. These compositions can take 
5 the form of solutions, suspensions, tablets, pills, 
capsules, sustained release formulations or powders and 
contain about 1 to 95% of the about 2 00 kDa outer 
membrane protein. The immunogenic preparations and 
vaccines are administered in a manner compatible with 

10 the dosage formulation, and in such amount as will be 
therapeutically effective, protective and immunogenic. 
The quantity to be administered depends on the subject 
to be treated, including, for example, the capacity of 
the individual's immune system to synthesize antibodies, 

15 and if needed, to produce a cell-mediated immune 
response. Precise amounts of active ingredient required 
to be administered depend on the judgement of the 
practitioner. However, suitable dosage ranges are 
readily determinable by one skilled in the art and may 

2 0 be of the order of micrograms of the about 2 00 kDa outer 
membrane protein. Suitable regimes for initial 

administration and booster doses are also variable, but 
may include an initial administration followed by 
subsequent administrations. The dosage may also depend 

2 5 on the route of administration and will vary according 

to the size of the host. 

The immunogenic preparations including vaccines may 
comprise as the immunostimulating material a nucleotide 
vector comprising at least a portion of the gene 

3 0 encoding the about 2 00 kDa protein, or the at least a 

portion of the gene may be used directly for 
immunization. 

The concentration of the about 2 00 kDa outer 
membrane antigen in an immunogenic composition according 
35 to the invention is in general about 1 to 95%. A 
vaccine which contains antigenic material of only one 
pathogen is a monovalent vaccine. Vaccines which 
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contain antigenic material of several pathogens are 
combined vaccines and also belong to the present 
invention. Such combined vaccines contain, for example, 
material from various pathogens or from various strains 
5 of the same pathogen, or from combinations of various 
pathogens . 

Immunogenicity can be significantly improved if the 
antigens are co-administered with adjuvants, commonly 
used as 0.05 to 0.1 percent solution in phosphate- 

10 buffered saline. Adjuvants enhance the immunogenicity 
of an antigen but are not necessarily immunogenic 
themselves. Adjuvants may act by retaining the antigen 
locally near the site of administration to produce a 
depot effect facilitating a slow, sustained release of 

15 antigen to cells of the immune system. Adjuvants can 
also attract cells of the immune system to an antigen 
depot and stimulate such cells to elicit immune 
responses . 

Immunostimulatory agents or adjuvants have been 

2 0 used for many years to improve the host immune responses 

to, for example, vaccines. Intrinsic adjuvants, such as 
lipopolysaccharides, normally are the components of the 
killed or attenuated bacteria used as vaccines. 
Extrinsic adjuvants are immunomodulators which are 
25 typically non-covalently linked to antigens and are 
formulated to enhance the host immune responses. Thus, 
adjuvants have been identified that enhance the immune 
response to antigens delivered parenterally . Some of 
these adjuvants are toxic, however, and can cause 

3 0 undesirable side-effects, making them unsuitable for use 

in humans and many animals. Indeed, only aluminum 
hydroxide and aluminum phosphate (collectively commonly 
referred to as alum) are routinely used as adjuvants in 
human and veterinary vaccines. The efficacy of alum in 
3 5 increasing antibody responses to diphtheria and tetanus 
toxoids is well established and a HBsAg vaccine has been 
adjuvanted with alum. While the usefulness of alum is 
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well established for some applications, it has 
limitations. For example, alum is ineffective for 
influenza vaccination and inconsistently elicits a cell 
mediated immune response. 
5 A wide range of extrinsic adjuvants can provoke 

potent immune responses to antigens. These include 
saponins complexed to membrane protein antigens (immune 
stimulating complexes) , pluronic polymers with mineral 
oil, killed mycobacteria in mineral oil, Freund's 

10 complete adjuvant, bacterial products, such as muramyl 
dipeptide (MDP) and lipopolysaccharide (LPS) , as well as 
lipid A, and liposomes. 

To efficiently induce humoral immune responses 
(HIR) and cell -mediated immunity (CMI) , immunogens are 

15 typically emulsified in adjuvants. Many adjuvants are 
toxic, inducing granulomas, acute and chronic 
inflammations (Freund's complete adjuvant) FCA, 
cytolysis (saponins and Pluronic polymers) and 
pyrogenicity, arthritis and anterior uveitis (LPS and 

2 0 MDP) . Although FCA is an excellent adjuvant and widely 

used in research, it is not licensed for use in human or 
veterinary vaccines because of its toxicity. 

Desirable characteristics of ideal adjuvants 
include : 
25 (1) lack of toxicity; 

(2) ability to stimulate a long-lasting immune 
response ; 

(3) simplicity of manufacture and stability in long- 
term storage; 

3 0 (4) ability to elicit both CMI and HIR to antigens 

administered by various routes, if required; 

(5) synergy with other adjuvants; 

(6) capability of selectively interacting with 
populations of antigen presenting cells (APC) ; 

35 (7) ability to specifically elicit appropriate T H 1 or 
T H 2 cell-specific immune responses; and 
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(8) ability to selectively increase appropriate 
antibody isotype levels (for example, IgA) against 
antigens . 

U.S. Patent No. 4,855,283 granted to Lockhoff et al 
5 on August 8, 1989 which is incorporated herein by 
reference thereto, teaches glycolipid analogues 
including N-glycosylamides , N-glycosyl ureas and N- 
glycosylcarbamates, each of which is substituted in the 
sugar residue by an amino acid, as immuno- modulators or 

10 adjuvants. Thus, Lockhoff et al . (US Patent No. 
4,855,283 and ref . 27) reported that N-glycolipid 
analogs displaying structural similarities to the 
naturally- occurring glycolipids, such as 

glycosphospholipids and glycoglycerolipids, are capable 

15 of eliciting strong immune responses in both herpes 
simplex virus vaccine and pseudorabies virus vaccine. 
Some glycolipids have been synthesized from long chain- 
alkylamines and fatty acids that are linked directly 
with the sugars through the anomeric carbon atom, to 

2 0 mimic the functions of the naturally occurring lipid 

residues . 

U.S. Patent No. 4,258,029 granted to Moloney, 
assigned to the assignee hereof and incorporated herein 
by reference thereto, teaches that octadecyl tyrosine 
25 hydrochloride (OTH) functioned as an adjuvant when 
complexed with tetanus toxoid and formalin inactivated 
type I, II and III poliomyelitis virus vaccine. Also, 
Nixon-George et al . (ref. 24), reported that octadecyl 
esters of aromatic amino acids complexed with a 

3 0 recombinant hepatitis B surface antigen, enhanced the 

host immune responses against hepatitis B virus. 

Lipidation of synthetic peptides has also been used 
to increase their immunogenicity . Thus, Wiesmuller 
(ref. 25) describes a peptide with a sequence homologous 
3 5 to a foot-and-mouth disease viral protein coupled to an 
adjuvant tripalmityl-S-glyceryl-cysteinylserylserine, 
being a synthetic analogue of the N-terminal part of the 
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lipoprotein from Gram negative bacteria. Furthermore, 
Deres et al . (ref. 26) reported in vivo priming of 
virus-specific cytotoxic T lymphocytes with synthetic 
lipopeptide vaccine which comprised of modified 
5 synthetic peptides derived from influenza virus 
nucleoprotein by linkage to a lipopeptide, N-palmityl-S- 
2, 3 -bis (palmitylxy) - (2RS) -propyl- [R] -cysteine (TPC) . 
2 . Immunoassays 

The about 2 00 kDa outer membrane protein of the 

10 present invention is useful as an immunogen for the 
generation of anti-200 kDa outer membrane protein 
antibodies, as an antigen in immunoassays including 
enzyme -linked immunosorbent assays (ELISA) , RIAs and 
other non-enzyme linked antibody binding assays or 

15 procedures known in the art for the detection of anti- 
bacterial, anti-Moraxella, and anti-200 kDa outer 
membrane protein antibodies. In ELISA assays, the about 
2 00 kDa outer membrane protein is immobilized onto a 
selected surface, for example, a surface capable of 

2 0 binding proteins such as the wells of a polystyrene 
microtiter plate. After washing to remove incompletely 
adsorbed about 200 kDa outer membrane protein, a 
nonspecific protein such as a solution of bovine serum 
albumin (BSA) that is known to be antigenically neutral 

2 5 with regard to the test sample may be bound to the 

selected surface. This allows for blocking of 

nonspecific adsorption sites on the immobilizing surface 
and thus reduces the background caused by nonspecific 
bindings of antisera onto the surface. 

3 0 The immobilizing surface is then contacted with a 

sample, such as clinical or biological materials, to be 
tested in a manner conducive to immune complex 
(antigen/ antibody) formation. This may include diluting 
the sample with diluents, such as solutions of BSA, 
35 bovine gamma globulin (BGG) and/or phosphate buffered 
saline (PBS)/Tween. The sample is then allowed to 
incubate for from 2 to 4 hours, at temperatures such as 
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of the order of about 2 0° to 3 7°C. Following 
incubation, the sample-contacted surface is washed to 
remove non-immunocomplexed material. The washing 

procedure may include washing with a solution, such as 
5 PBS/Tween or a borate buffer. Following formation of 
specific immunocomplexes between the test sample and the 
bound about 2 00 kDa outer membrane protein, and 
subsequent washing, the occurrence, and even amount, of 
immunocomplex formation may be determined by subjecting 

10 the immunocomplex to a second antibody having 
specificity for the first antibody. If the test sample 
is of human origin, the second antibody is an antibody 
having specificity for human immunoglobulins and in 
general IgG. To provide detecting means, the second 

15 antibody may have an associated activity such as an 
enzymatic activity that will generate, for example, a 
colour development upon incubating with an appropriate 
chromogenic substrate. Quantification may then be 
achieved by measuring the degree of colour generation 

2 0 using, for example, a visible spectrophotometer. 
3 . Use of Sequences as Hybridization Probes 

The nucleotide sequences of the present invention, 
comprising the sequence of the about 2 00 kDa protein 
gene, now allow for the identification and cloning of 

2 5 the about 2 00 kDa protein gene from any species of 

Moraxella. 

The nucleotide sequences comprising the sequence of 
the about 2 00 kDa protein gene of the present invention 
are useful for their ability to selectively form duplex 

3 0 molecules with complementary stretches of other about 

200 kDa protein genes. Depending on the application, a 
variety of hybridization conditions may be employed to 
achieve varying degrees of selectivity of the probe 
toward the other genes. For a high degree of 

35 selectivity, relatively stringent conditions are used to 
form the duplexes, such as low salt and/or high 
temperature conditions, such as provided by 0.02 M to 
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0.15 M NaCl at temperatures of between about 50°C to 
70°C. For some applications, less stringent 
hybridization conditions are required such as 0.15 M to 
0.9 M salt, at temperatures ranging from between about 
5 20°C to 55°C. Hybridization conditions can also be 
rendered more stringent by the addition of increasing 
amounts of formamide, to destabilize the hybrid duplex. 

Thus, particular hybridization conditions can be 
readily manipulated, and will generally be a method of 

10 choice depending on the desired results. In general, 
convenient hybridization temperatures in the presence of 
50% formamide are: 42°C for a probe which is 95 to 100% 
homologous to the target fragment, 37°C for 90 to 95% 
homology and 32°C for 85 to 90% homology. 

15 In a clinical diagnostic embodiment, the nucleic 

acid sequences of the about 2 00 kDa protein genes of the 
present invention may be used in combination with an 
appropriate means, such as a label, for determining 
hybridization. A wide variety of appropriate indicator 

20 means are known in the art, including radioactive, 
enzymatic or other ligands, such as avidin/biotin and 
digoxigenin-labelling, which are capable of providing a 
detectable signal. In some diagnostic embodiments, an 
enzyme tag such as urease, alkaline phosphatase or 

2 5 peroxidase, instead of a radioactive tag may be used. In 

the case of enzyme tags, colorimetric indicator 
substrates are known which can be employed to provide a 
means visible to the human eye or 

spectrophotometrically, to identify specific 

3 0 hybridization with samples containing about 2 00 kDa 

protein gene sequences . 

The nucleic acid sequences of the about 2 00 kDa 
protein genes of the present invention are useful as 
hybridization probes in solution hybridizations and in 
35 embodiments employing solid-phase procedures. In 
embodiments involving solid-phase procedures, the test 
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DNA (or RNA) from samples, such as clinical samples, 
including exudates, body fluids (e. g., serum, amniotic 
fluid, middle ear effusion, sputum, bronchoalveolar 
lavage fluid) or even tissues, is adsorbed or otherwise 
5 aff ixed to a selected matrix or surface. The fixed, 
single-stranded nucleic acid is then subjected to 
specific hybridization with selected probes comprising 
the nucleic acid sequences of the about 2 00 kDa protein 
encoding genes or fragments or analogs thereof of the 

10 present invention under desired conditions. The selected 
conditions will depend on the particular circumstances 
based on the particular criteria required depending on, 
for example, the G+C contents, type of target nucleic 
acid, source of nucleic acid, size of hybridization 

15 probe etc. Following washing of the hybridization 
surface so as to remove non- specif ically bound probe 
molecules, specific hybridization is detected, or even 
quantified, by means of the label. It is preferred to 
select nucleic acid sequence portions which are 

2 0 conserved among species of Moraxella. The selected 
probe may be at least 18bp and may be in the range of 
about 3 0 to 90 bp. 

4. Expression of the about 200 kDa Protein Gene 

Plasmid vectors containing replicon and control 

2 5 sequences which are derived from species compatible with 

the host cell may be used for the expression of the 
genes encoding the about 2 00 kDa protein in expression 
systems. The vector ordinarily carries a replication 
site, as well as marking sequences which are capable of 

3 0 providing phenotypic selection in transformed cells. For 

example, E. coli may be transformed using pBR322 which 
contains genes for ampicillin and tetracycline 
resistance and thus provides an easy means for 
identifying transformed cells. The plasmids or phage, 
3 5 must also contain, or be modified to contain, promoters 
which can be used by the host cell for expression of 
its own proteins. 
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In addition, phage vectors containing replicon and 
control sequences that are compatible with the host can 
be used as a transforming vector in connection with 
these hosts. For example, the phage in lambda GEM™- 11 
5 may be utilized in making recombinant phage vectors 
which can be used to transform host cells, such as E. 
coli LE3 92. 

Promoters commonly used in recombinant DNA 
construction include the p- lactamase (penicillinase) and 

10 lactose promoter systems and other microbial promoters, 
such as the T7 promoter system as described in U.S. 
Patent No. 4,952,4 96. Details concerning the nucleotide 
sequences of promoters are known, enabling a skilled 
worker to ligate them functionally with genes. The 

15 particular promoter used will generally be a matter of 
choice depending upon the desired results. Hosts that 
are appropriate for expression of the about 2 00 kDa 
protein genes, fragments, analogs or variants thereof, 
may include E. coli, Bacillus species, Haemophilus, 

2 0 fungi, yeast, Bordetella, or the baculovirus expression 
system may be used. 

In accordance with this invention, it is preferred 
to make the protein by recombinant methods, particularly 
when the naturally occurring about 2 00 kDa protein as 

2 5 purified from a culture of a species of Moraxella may 

include trace amounts of toxic materials or other 
contaminants. This problem can be avoided by using 
recombinantly produced protein in heterologous systems 
which can be isolated from the host in a manner to 
30 minimize contaminants in the purified material. 
Particularly desirable hosts for expression in this 
regard include Gram positive bacteria which do not have 
LPS and are, therefore, endotoxin free. Such hosts 
include species of Bacillus and may be particularly 

3 5 useful for the production of non-pyrogenic about 2 00 kDa 

protein, fragments or analogs thereof. 
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BIOLOGICAL DEPOSITS 
Certain plasmids that contain portions and full- 
length of the gene having the open reading frame of the 
gene encoding the about 2 00 kDa outer membrane protein 
5 of M. catarrhal is strain 4223 that are described and 
referred to herein have been deposited with the America 
Type Culture Collection (ATCC) located at 10801 
University Blvd., Manassas, VA 20110-2209, U.S.A., 
pursuant to the Budapest Treaty and pursuant to 3 7 CFR 
10 1.808 and prior to the filing of this application. 

Samples of the deposited plasmids will become 
available to the public upon grant of a patent based 
upon this United States patent application or relevant 
precursor applications. The invention described and 
15 claimed herein is not to be limited in scope by plasmids 
deposited, since the deposited embodiment is intended 
only as an illustration of the invention. Any 
equivalent or similar plasmids that encode similar or 
equivalent antigens as described in this application are 

2 0 within the scope of the invention. 

Plasmid ATCC Designation Date Deposited 
pKS47 97,111 April 7, 1995 

pKS5 97,110 April 7, 1995 

pKS9 97,114 April 18, 1995 

25 pKS294 203,528 December 17, 1998 

pKS348 203,529 December 17, 1998 

EXAMPLES 

The above disclosure generally describes the 
present invention. A more complete understanding can be 

3 0 obtained by reference to the following specific 

Examples. These Examples are described solely for 
purposes of illustration and are not intended to limit 
the scope of the invention. Changes in form and 
substitution of equivalents are contemplated as 
35 circumstances may suggest or render expedient. Although 
specific terms have been employed herein, such terms are 
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intended in a descriptive sense and not for purposes of 
limitations . 

Methods of molecular genetics, protein 
biochemistry, and immunology used but not explicitly 
5 described in this disclosure and these Examples are 
amply reported in the scientific literature and are well 
within the ability of those skilled in the art. 
Example 1 

This Example describes the cloning of a gene 
10 encoding the M. Cd.izci2r2rh.cili s 2 00 kDa outer membrane 
protein. 

A M. catarrhal is genomic library in phage lambda 
EMBL3 was prepared as described in Example 9 of USP 
5,808,024 and WO 96/34960 and was screened using guinea 

15 pig anti-200 kDa protein antiserum. A lambda phage clone 
811, which expressed an about 200 kDa protein, was 
confirmed by immunoblotting of the phage lysate using 
the about 200 kDa outer membrane -specific antiserum. 

Plate lysate cultures of this recombinant phage 

2 0 were prepared. The DNA was extracted from the plate 
lysates using a Wizard Lambda Preps DNA Purification 
System (Promega Corp, Madison, WI) according to the 
manufacturer's instructions. This phage clone carried 
a DNA insert of about 16 kb in size (the restriction map 

2 5 for which is shown in Figure 1) . The phage DNA was 
digested with a mixture of the restriction enzymes Sail 
and Xhol, and separated by agarose gel electrophoresis. 
Two DNA bands, approximately 5 kb and 11 kb in size, 
respectively, were cut out from the gel and extracted 

30 using a Geneclean kit (BIO 101 Inc., LaJolla, CA) 
according to the manufacturer's direction. 

The smaller 5 kb fragment was ligated into a 
plasmid vector, pBluescript II SK +/- (Stratagene 
Cloning Systems, LaJolla, CA) , which had been previously 

35 digested with Sail and Xhol, to produce plasmid pKS5. 
The larger 11 kb fragment was ligated into a plasmid 
vector, pSP72 (Promega Corp., Madison, WI) , digested 
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with Sail and Xhol , , to produce plasmid pKS9. Both 
ligated plasmids were used to transform E, calx, strain 
DH5cc - 

The lambda phage DNA was also digested with a 
5 mixture of Xhol and Kpnl and the approximately 1.1 kb 
fragment was isolated after agarose gel separation as 
described above. This 1.1 kb fragment was ligated into 
a plasmid vector, pGEM-7Zf(+) (Promega Corp., Madison, 
WI) , to produce plasmid pKS47. 
10 Example 2 

This Example describes the isolation of chromosomal 
DNA from AT. catarrhal is for use in PCR amplification. 

M. catarrhalis was cultured in 2 5 ml of BHI broth 
overnight and centrifuged at 5,000 rpm for 10 min. The 
15 bacteria pellet was suspended in 10 ml of 10 mM Tris/HCl 
(pH 8.0) containing 100 mM EDTA and mixed with RNaseA 
(final concentration: 100 jag/ml) and lysozyme (final 
concentration: 1 mg/ml) . After incubation on ice for 10 
min and at room temperature for 5 0 min, the suspension 

2 0 was gently mixed with 1 ml of 10% SDS and then heated at 

65°C for 2 0 min. The suspension was mixed with 

proteinase K (final concentration: 2 00 jxg/ml) and 

incubated at 50°C for 1 h. The suspension was gently 
mixed with 10 ml chloroform on a nutator for 15 min and 
25 centrifuged at 5,000 rpm for 10 min. The upper phase was 
slowly removed with a wide -bore pipette and mixed with 
10 ml of Tris-saturated phenol and 10 ml of chloroform 
on a nutator. After centrif ugation at 5,000 rpm for 10 
min, the upper phase was re-extracted with a mixture of 

3 0 Tris-saturated phenol and chloroform, again, and then 

extracted with chloroform, and then twice dialyzed 
against 1M NaCl at 4°C and twice against TE buffer (pH 
8.0) at 4°C. 
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Example 3 

This Example describes subcloning and sequence 
analysis of fragments of the 2 00 kDa protein gene from 
M. catarrhal is strain 4223. 

5 The procedures used to produce a phage A.EMBL3 clone 

811, and its subclones, pKS5, pKS9 and pKS47, are 
described in USP 5,808,024 and WO 96/34960. pKSlO was 
constructed from the A.EMBL3 clone 811 exactly as 
described for pKS9. pKS59 and pKS63 were constructed by 

10 insertion of a 1.4 kb Xbal-Ncol fragment of pKS9 into 
pGEM5Z( + ) that had been digested with Ncol and Spel . 
pKS71 was made by insertion of the same 1.4 kb Xbal-Ncol 
fragment, isolated from the XEMBL3 clone 811 into 
pGEM5Z(+). Sequence analysis confirmed that all three 

15 plasmids, pKS59, pKS63 and pKS71, carried identical DNA 
fragments. Figure 1 shows partial restriction maps for 
the plasmids. 

The full sequence of the 200 kDa gene locus from 
the A,DNA clone was described in USP 5,808,024 and WO 

20 96/34960 and is shown in Figure 2. There is a tract of 
10 consecutive G nucleotides between position 623 and 
632 in clones derived from the X library. The first 
possible start codon is, therefore, located at 
nucleotides 706 to 708 and is a GTG encoding a valine, 

25 boxed lightly in Figure 2. A series of strains 
expressing a 200 kDa gene, were identified by immunoblot 
analysis and the 5 r end of their 200 kDa genes was PCR 
amplified and sequenced. A summary of the findings is 
shown in Table 5 wherein the expression level of the 

3 0 gene appeared to be related to the number of G 
nucleotides in the tract and for those strains within 
higher expression levels, the start codon was an ATG 
upstream of the GTG codon identified from the 4223 X 
clones. Based upon these findings, the sequence of the 

35 5 r end of the 200 kDa gene from strain 4223 was re- 
examined . 
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Plasmids pKS9 and pKSlO were directly derived from 
the X clone. The subclones pKS59 and pKS63 were derived 
from pKS9 whereas pKS71 contained the same fragment 
derived directly from the X clone. All of these plasmids 
5 contained 10 G nucleotides in the G tract, as described 
previously. To determine whether the X clone contained 
an extra G nucleotide or the strain itself contained an 
aberrant gene, PCR amplification of the region was 
performed from chromosomal DNA preparations and from the 

10 X subclones. The data in Table 3 show that PCR fragments 

of the X subclones all contained 10 G nucleotides. The 
data in Table 4, however, demonstrate that PCR fragments 
derived directly from chromosomal DNA, contain 9 G 
nucleotides in the tract. When the single extra G 
15 nucleotide is removed from the 200 kDa sequence of 
strain 4223, the open reading frame is extended in the 
5' direction to start from an ATG codon 156 nucleotides 
earlier, at positions 541 to 543 in Figure 2 . This new 
start codon corresponds to that suggested for the 2 00 

2 0 kDa genes sequenced from other strains and summarized in 

Table 5. 
Example 4 

This Example describes the construction of the full 
length 200 kDa protein gene from M. catarrhalis strain 
25 4223. The construction scheme is shown in Figure 9. 

The full-length 200 kDa protein gene was 
constructed from the new ATG start codon identified by 
analysis of the chromosomally derived DNA as described 
in Example 3 and shown in Figure 3. pKS47 was digested 

3 0 with Xhol and Kpnl and separated by agarose gel 

electrophoresis. The 1.1 kb fragment was isolated from 
the gel and inserted into pKS5, which had previously 
been digested with the same two enzymes and purified to 
form pKS80. An about 5.8 kb PstI fragment from pKS80 was 
35 inserted into pT7-7 vector (ref. 28) that had been 
digested with PstI and dephosphorylated. The orientation 
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of the insert was determined by restriction enzyme 
analysis and pKS122 was chosen for further construction 
(see Figure 7) . 

The 5' region of the 200 kDa protein gene was 
5 amplified from strain 4223 chromosomal DNA. PCR 
reactions were performed using Taq Plus or Tsg Plus 
enzyme (Sangon Ltd., Scarborough, Ont . , Canada) and a 
Perkin Elmer DNA Thermocycler (Perkin Elmer Cetus, 
Foster City, CA, USA) . The lower PCR reaction mixture 

10 (50 nl) contained 5 jal of 10X buffer, 0.4 mM each of 
four deoxynucleotide triphosphates (Perkin Elmer, Foster 
City, CA, USA) and 1 to 2 each of two primers. The 

upper PCR reaction mixture (50 pJYO contained 5 jliI of 10X 
buffer, 0 . 5 to 1 ^1 of Taq Plus or Tsg Plus enzyme, and 

15 template DNA. The lower and upper mixtures were 
separated by a layer of AmpliWax PCR Gem5 0 (Perkin 
Elmer, Foster City, CA, USA) before heating cycles 
started . The thermocycling condition employed for the 
provision of PCR products in the construction of various 

2 0 plasmids are set forth in Table 11 below. The PCR 
products were purified using a QIAquick PCR purification 
kit (Qiagen Inc., Mississauga, Ont., Canada). The 
purified PCR products were sequenced on both strands 
directly and/or after cloning in appropriate vectors 

2 5 using an Applied Biosystem sequencer. 

The 5 ! primer (designated 5295. KS) was designed, so 
that it contained the first possible translation start 
codon, ATG, and its flanking sequences with a mutation 
to introduce an Ndel site at the ATG. The 3 ! primer 
30 (designated 4260. KS) was based upon the non-coding 
strand in the region about 1 kb downstream from the ATG 
start codon. (The nucleic acid sequences and SEQ ID'S of 
the PCR primers utilized herein are identified in Table 
10) . The PCR-product was digested with Ndel and an 

3 5 approximately 65 0 bp DNA fragment was gel purified and 
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inserted into pKS122, which had previously been 
linearized with Ndel and dephosphorylated. 

The new construct, designated pKS294 (Figure 8), 
was confirmed by restriction enzyme analyses and by 
5 sequencing of the PCR-amplif ied DNA and its joint 
regions. The number of G nucleotides in the G tract was 
nine, and the open reading frame continued from the 
newly found translation start codon, ATG, to the 
remaining portion of 200 kDa protein gene in pKS122 . 
10 pKS294, therefore, carried the correct, full-length 200 
kDa protein gene from Moraxella cata.rrha.lis strain 4223. 
During construction of pKS294, E. coli strain DH5 a was 
used for transformation and plasmid analyses. 
Example 5 

15 This Example describes the cloning and sequence 

analysis of genes encoding the 2 00 kDa protein from 
additional M. catarrhalis clinical isolates. 

A panel of M. catarrhalis clinical isolates was 
analysed by immunoblot with guinea pig anti-200 kDa 

20 antibody, as described in USP 5,808,024 and WO 96/34960. 
From these analyses, it was evident that there is size 
heterogeneity among the 2 00 kDa proteins from various 
strains. In order to assess the possible genetic 
heterogeneity, representative strains were chosen for 

2 5 gene cloning. Strain Q8 is a naturally occurring 
relatively non-clumping strain that produces a 2 00 kDa 
protein of about the same size as the 4223 -derived 
protein. Strain LES-1 produces a larger 200 kDa protein. 
These strains were also selected based upon bactericidal 

30 antibody data as illustrated in Table 1. The 200 kDa 
genes were cloned from these two strains of M. 
catarrhalis and sequenced. 

The nucleotide and derived amino acid sequences of 
the 2 00 kDa genes from strains Q8 and LES-1 are shown in 

35 Figures 4 and 5 respectively. An alignment of the amino 
acid sequences with the 4223 -derived sequence is shown 
in Figure 6. As can be seen, the first 68 residues of 
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the N-terminus are quite conserved, especially between 
strains 4223 and Q8 . In addition, the final 456 residues 
of the C-terminus are nearly identical among the three 
strains. The remainder of the sequence has regions of 
5 high homology and significant diversity, including an 
insert of more than 300 residues for strain LES-1. 

The N- terminal sequence of the 2 00 kDa proteins is 
homologous to the H. influenzae Hia and Hsf proteins, as 
well as other high molecular weight proteins or 

10 adhesins, such as AIDA (ref. 33) . 

The C-terminal region also has some homology to H. 
influenzae Hia and Hsf proteins as do some stretches of 
internal sequence. There is also some homology in the C- 
terminal region to UspA (ref. 23). A further indication 

15 of the relatedness of this family of proteins, is the 
finding that guinea pig anti-2 0 0 kDa antibody raised to 
gel-purified native protein was able to recognize 
recombinant Hia protein by immunoblot . This data has 
been described in copending United States Patent 

20 Application No. 09/268,347 (Hia) filed March 16, 1999, 
assigned to the assignee hereof and the disclosure of 
which is incorporated herein by reference. 
Example 6 

This Example shows the expression of the full- 

25 length about 200 kDa protein from pKS294. 

E. coli strain, BL21 (DE3 ) /pLysS was transformed by 
electroporation with pKS294, prepared as described in 
Example 4, for the expression study of the full-length 
200 kDa protein gene. 

3 0 The product of the pKS2 94 construct was found to be 

toxic to the host E. coli. At room temperature, the 
BL21 (DE3) /pLysS transf ormants grew very slowly on LB- 
agar plates containing ampicillin (Amp) and 
chloramphenicol (Cm) and at 37°C, no transf ormants were 

3 5 detected. When the transf ormants which grew at room 
temperature, were cultured overnight at 3 0°C on BHI agar 
containing the two antibiotics and glucose, they grew 
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well, producing colonies with a normal size. However, 
when these clones were cultured overnight in liquid 
medium at 3 0°C, subcultured into broth without glucose, 
and then induced by addition of IPTG, no recombinant 
5 protein was found on Western blot using anti-2 0 0 kDa 
protein serum. When the cells cultured overnight were 
examined before subculturing, a small quantity of 
recombinant 2 00 kDa protein was detected by SDS-PAGE 
stained with Coomassie Blue and by Western blot, showing 
10 that the gene was expressed during the overnight 
culture . 

When E. coli strain, DH5a, which cannot express the 
gene under the control of a T7 promoter, was transformed 
with pKS294, the transf ormants grew well at 37°C both on 

15 LB-agar and in LB-broth containing the antibiotics. 
These results suggest that the gene product is very 
toxic to host E. coli, and that even a basal level of 
expression of the full-length 200 kDa protein gene from 
the ATG is lethal to E. coli. 

2 0 AT. cata.rrha.lis strain LES-1 also produced similar 

toxicity in E. coli when the full length 200 kDa protein 
was expressed. 
Example 7 

This Example describes the deletion of a short 5 1 - 
25 sequence from the strain 4223 or strain LES-1 200 kDa 
protein gene and expression of the truncated genes 
producing a M56 r200 kDa product. 

The deletion of a short 5' region from the 4223 200 
kDa protein gene is shown in Figure 10 and was performed 
30 using a similar approach as described in Example 4. An 
about 500 bp 5' region of the 2 00 kDa gene was PGR 
amplified from strain 4223 using primers 5471. KS and 
4257. KS (Table 8) from chromosomal DNA. The 5 1 primer 
(designated 54 71.KS) was based upon the region 
35 surrounding the previously identified GTG downstream 
start codon. In primer 5471. KS, the flanking regions 
around the GTG codon were incorporated and the GTG was 
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mutated to ATG with further mutations used to introduce 
an Ndel site incorporating the new ATG . Using numbering 
from the full-length 200 kDa protein, the new start 
codon would be M56 replacing the previous V56 codon. The 
5 3' primer (designated 4257. KS) was based upon the non- 
coding strand located about 50 0 bp downstream from the 
GTG codon in the 200 kDa protein gene. The PCR-product 
was digested with Ndel , purified using a QIAquick PCR 
purification kit (Qiagen Inc., Mississauga, Ont . ) , and 

10 inserted into JVdel digested and dephosphorylated pKS122 
to provide pKS348 (see Figure 7) . Plasmid pKS348 was 
confirmed by restriction enzyme analyses and by 
sequencing of the PCR-amplif ied DNA piece and its joint 
regions. The nucleotide sequence (SEQ ID No: 12) and the 

15 deduced amino acid sequence (SEQ ID No: 13) for the 5 1 - 
truncation contained in pKS348 are shown in Figure 8. A 
similar N- terminal truncated 2 00 kDa gene from strain 
LES-1 was generated in the same manner and was 
designated pKS444 . 

20 A single colony of E. coli, BL21 (DE3) /pLysS, 

(KS358) which carried pKS348, was suspended in 5 ml of 
BHI broth containing Amp (100 |oM) , Cm (50 jJVQ and 0.4% 
of glucose, and cultured overnight at 3 7°C. To study the 
kinetics of expression, 2.5 ml of the overnight culture 

25 was added to 250 ml of LB (Luria-Bertani) broth 
containing Amp (100 (iM) and Cm (50 jjJVI) , and grown with 
shaking at 37°C to A 600 = 0 . 33 to 0.36. Another 0.3 ml of 
the overnight culture was added to 3 0 mL of LB broth 
containing Amp (100 p,M) and Cm (50 jJVE) and grown with 

30 shaking at 37°C to A 600 = 0 . 26 to 0.44. Gene expression 
from the cultures was induced by addition of IPTG (final 
concentration: 4 mM) . The bacteria were grown and 
harvested at different time points by centrif ugation . 
The expression of the 2 00 kDa protein gene in the 

35 culture was confirmed by SDS-PAGE analysis using 
Coomassie Blue staining and by Western blot analysis 
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using guinea pig anti-2 00 kDa protein serum, as 
described in USP 5,808,024 and WO 96/34960. 

When E. coli BL21 (DE3) /pLysS was transformed with 
pKS34 8, transformants grew well even on LB agar plates 
5 and in LB broth containing antibiotics at 3 7°C. After 
induction with IPTG, these clones produced a large 
amount of the N- terminally truncated r2 0 0 kDa protein 
which was clearly seen by SDS-PAGE Coomassie Blue stain, 
as shown in Figure 12 . 

!0 The bacterial culture induced at A 600 = 0 . 26 

produced slightly more truncated r200 kDa protein than 
the culture induced when the OD reading was 0.44. The 
largest amount of truncated r2 0 0 kDa protein was seen at 
3 hr after induction. Similar results were observed for 

15 the M56 r200 kDa expression from strain LES-1. 
Example 8 

This Example describes the purification of the M56 
r200 kDa proteins from strain 4223 or LES-1, according 
to the procedure shown in Figure 11. 
20 E. coli cell pellets were obtained from 500 ml 

culture prepared as described in Example 7, by 
centrifugation and were resuspended in 50 ml of 50 mM 
Tris-HCl, pH 8.0, containing 0.1 M NaCl, and disrupted 
by sonication. The sonicate was centrifuged at 20,000 xg 

2 5 for 3 0 min. and the resultant supernatant (supl) was 

discarded. The pellet (pptl) was extracted, in 50 ml of 
50 mM Tris-HCl, pH 8 . 0 containing 0.5% Triton X-100 and 
10 mM EDTA, then centrifuged at 20,000 xg for 30 min. 
and the supernatant (sup2) was discarded. The pellet 

3 0 (ppt2) was further extracted in 5 0 ml of 5 0 mM Tris-HCl, 

pH 8.0, containing 1% octylglucoside , then centrifuged 
at 2 0,000 xg for 3 0 min. and the supernatant (sup3) was 
discarded. 

The resultant pellet (ppt3) contained the inclusion 
35 bodies. The pellet was solubilized in 6 ml of 50 mM 
Tris-HCl, pH 8.0, containing 6 M guanidine and 5 mM DTT. 
Twelve ml of 50 mM Tris-HCl, pH 8.0 was added, the 
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mixture centrifuged at 20,000 xg for 30 min, and the 
pellet (ppt4) discarded. The supernatant (sup4) was 
precipitated by adding polyethylene glycol (PEG) 4000 at 
a final concentration of 5% and incubated at 4°C for 3 0 
5 min. The resultant pellet (ppt5) was removed by 
centrifugation at 20, 000 xg for 30 min. The supernatant 
was then precipitated by (NH 4 ) 2 S0 4 at 50% saturation at 

4°C overnight. After the addition of (NH 4 ) 2 S0 4 , the 
solution underwent phase separation with protein going 

10 to the upper phase (as judged by the cloudiness of the 
layer) . The upper phase was collected, then subjected to 
centrifugation at 20,000 xg for 30 min. The resultant 
pellet was collected and dissolved in 2 ml of 50 mM 
Tris-HCl, pH 8.0, containing 6 M guanidine and 5 mM DTT. 

15 The clear solution was purified on a Superdex 200 gel 
filtration column equilibrated in 50 mM Tris-HCl, pH 
8.0, containing 2 M guanidine HC1 . The fractions were 
analysed by SDS-PAGE and those containing the purified 
r2 0 0 kDa were pooled. The pooled fraction was 

2 0 concentrated 5 to 10 fold using a centriprep 3 0 and then 

dialysed overnight at 4°C against PBS, and centrifuged 
at 20,000 xg for 30 min to clarify. 

The protein remained soluble under these conditions 
and glycerol was added to the M56 r2 0 0 kDa preparation 
25 at a final concentration of 20% for storage at -20°C 
(Figure 12) . The average yield of the purified M56 r200 
kDa protein is about 10 mg L" 1 culture. The purified 
protein was used for the immunization of animals, as 
described below. 

3 0 The procedure of this Example 8 and was repeated 

for M. ca.tarrha.lis strain LES-1 and a corresponding r200 
kDa protein was produced. The N- terminal truncated M56 
r2 00 kDa protein from strain LES-1 gave approximately 
the same recovery of purified protein as described above 
35 for strain 4223. 
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Example 9 

This Example illustrates the immunogenicity of the 
M56 r2 0 0 kDa protein. 

The immunogenicity of M56 r200 kDa, prepared as 
5 described in Example 8, was examined using mice and 
guinea pigs. Groups of five BALB/c mice (Charles River, 
Quebec) were immunized sub-cutaneously (s.c.) on days 1, 
29 and 43 with 0.3, 1.3 and 10 jug of 4223 M56 r200 kDa 
antigen, prepared as described in Example 8, in the 
10 presence A1P0 4 (1.5 mg per dose) . Blood samples were 
collected on days 0, 14, 28, 42 and 56. 

Groups of five guinea pigs (Charles River, Quebec) 
were immunized i.m. on days 1, 29 and 43 with 25, 50 and 
100 ^g of 4223 M56 r2 0 0 kDa antigen prepared as 
15 described in Example 8, in the presence A1P0 4 (1.5 mg 
per dose) . Blood samples were collected on days 0, 14, 
28, 42 and 56. 

Anti-M56 r200 kDa IgG titers were determined by 
antigen-specific enzyme-linked immunosorbent assays 
20 (EIAs) . Microtiter wells (Nunc-MAXISORP, Nunc, Denmark) 
were coated with 50 jj,L of protein antigen 0.2 jjg mL" 1 ) . 
The reagents used in the assays were as follows: 
affinity-purified F(ab f )2 fragments of goat anti-mouse 
IgG (Fc-specif ic) conjugated to horseradish peroxidase 

2 5 (Jackson ImmunoResearch Labs, Mississauga, Ontario) ; 

affinity-purified guinea pig anti-IgG antibody (1 p,g ml" 
2 ) (prepared by the inventors) ; and affinity-purified 
F(ab r )2 fragment of goat anti-guinea pig IgG (H+L) 
antibodies conjugated to horseradish peroxidase (HRP) 

3 0 (Jackson ImmunoResearch Laboratories) used as a 

reporter. The reactions were developed using 
tetramethylbenzidine (TMB/H 2 0 2 , ADI , Mississauga, 
Ontario) and absorbancies were measured at 450 nm (using 
54 0 nm as a reference wavelength) in a Flow Multiskan 
35 MCC microplate reader (ICN Biomedicals, Mississauga, 
Ontario) . The reactive titer of an antiserum was defined 
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as the reciprocal of the dilution consistently showing a 
two- fold increase in absorbance over that obtained with 
the pre-bleed serum sample. 

The mice generated dose -dependent anti-M56 r200 kDa 
5 antibody responses, as shown in Figure 14. These 
results clearly show that the protein remained 
immunogenic after inclusion bodies extraction, 
solubilization and purification. Only a slight 
difference in the antibody titers were found for the 
10 higher dose range tested in guinea pigs (Figure 15) , 
indicating that the amount of antigen used was nearly at 
saturation. 
Example 10 

This Example describes the generation of hyper- 
15 immune sera against the M56 r200 kDa proteins in rabbits 
and guinea pigs . 

To generate hyper- immune sera against M56 r2 00 kDa 
proteins, groups of two rabbits and two guinea pigs 
(Charles River, Quebec) were immunized intramuscularly 
2 0 (i.m.) on day 1 with a 5 jag dose of purified M56 r2 0 0 
kDa protein, prepared as described in Example 8, 
emulsified in complete Freund's adjuvant (CFA) . Animals 
were boosted on days 14 and 29 with the same dose of 
protein emulsified in incomplete Freund's adjuvant 

2 5 (IFA) . Blood samples were taken on day 42 for analyzing 

the anti-M56 r200 kDa antibody titers and bactericidal 
activities. Anti-r200 kDa IgG titers were determined by 
antigen- specific enzyme-linked immunosorbent assays 
(EIAs) , as described in Example 9. The results obtained 

3 0 in the two animals using r2 0 0 kDa protein from strains 

4223 and LES-1 are illustrated in Table 6. 
Example 11 

This Example describes a bactericidal antibody 
assay. 

35 The bactericidal antibody activity of guinea pig 

anti-M56 r200 kDa sera from 4223 or LES-1 protein 
prepared as described in Example 10 against various 
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strains of M. catarrhal is was estimated using a 
viability plating assay. Each test strain of M. 
catarrhal! s was cultured overnight in brain heart 
infusion (BHI) broth (Difco Laboratories, Detroit, MI) 
5 at 3 7°C. The overnight culture was subcultured into 10 
ml BHI broth, and grown to an absorbance at 578 nm of 
0.5. The number of bacteria at A 578 = 0 . 5 changes from 
strain to strain. Therefore, several ten-fold dilutions 
of each strain were used in order to achieve 100 to 3 00 

10 colonies per plate for the preimmune serum group. 
Bacteria were diluted in Veronal buffered saline (VBS, 
pH 7.6) containing 140 mM NaCl, 93 mM NaHC0 3 , 2 mM Na- 
barbiturate, 4 mM barbituric acid, 0.5 mM MgCl 2 .6H 2 0, 
0.4 mM CaCl 2 .2H 2 0, and 0.1% bovine serum albumin. Guinea 

15 pig anti-M56 r2 0 0 kDa serum and pre -immune control serum 
were heated at 56°C for 3 0 min. to inactivate endogenous 
complement. Serum and antiserum were diluted in VBS, and 
placed on ice. 

Twenty- five |j,l of diluted pre -immune serum or test 

2 0 antiserum were added to the wells of a 96-well Nunclon 

microtitre plate (Nunc, Roskilde, Denmark) . Twenty- five 

jal of diluted bacterial cells were added to each of the 
wells. A guinea pig complement (BioWhittaker , 
Walkerville, MD) was diluted 1:10 in VBS, and 2 5 |il 
25 portions were added to each well. The plates were 
incubated for 60 min, gently shaking at 70 rpm on a 
rotary platform. Fifty jil of each reaction mixture were 
plated onto Mueller Hinton agar plates (Becton- 
Dickinson, Cockeysville, MD) . The plates were incubated 

3 0 at 3 7°C for 24 hours, and then left at room temperature 

for a further 24 hours. The number of colonies per plate 
was counted, and average values of colonies per plate 
were estimated from duplicate pairs. 

When pre -immune serum plates were compared with PBS 
35 control plates (no serum) , pre -immune serum had no 
bactericidal effect on the homologous strain 4223 . 
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Therefore, it was assumed that the number of colonies 
per plate on pre-immune serum plates represented 100% 
viability for each strain and percent bactericidal 
killing was calculated as follows: 



100%- j average number of colonies per plate in anti-r20Q kDa antiserum group x 100 



% 



average number of colonies per plate in pre-immune serum group 
5 When the bactericidal antibody activity of the 4223 

anti-M56 r2 0 0 kDa antiserum was examined against the 
homologous strain (Table 7) , 50% killing was observed at 
a serum dilution between 1/512 and 1/1024, showing that 
the antiserum raised against M56 r200 kDa protein 
10 possesses bactericidal antibody activity. Next, the 
bactericidal antibody activity of the antiserum was 
tested at a dilution of 1/64 against a total of 55 
different strains, which were isolated from otitis media 
patients in various geographical locations (Table IB) . 
15 The antiserum raised against the M56 r200 kDa protein 
from strain 4223 showed more than 3 0% bactericidal 
antibody activity against 38 out of 56 (68%) strains 
examined. When LES-1 anti-M56 r2 00 kDa antibody was 
tested in the bactericidal antibody assay, 36/55 (65%) 

2 0 strains were killed, including 11 strains that were not 

killed by the 4223 anti-M56 r200 kDa antibody. Only six 
strains out of 55 strains examined were not killed by 
either one of the two antisera. These results indicate 
that the 2 00 kDa protein is a very good candidate for 
25 inclusion in an otitis media vaccine. 
Example 12 

This Example describes the inhibition of binding of 
M. catarrhalis strains to either Chang or Hep- 2 
epithelial cells by 4223 anti-M56 r200 kDa serum. 

3 0 The 200 kDa protein had previously been proposed to 

be an adhesin on the basis of its apparent absence from 
a spontaneous non-clumping variant of strain 4223. This 
strain, obtained by serial passaging of culture 
supernatant s, was designated RH4 0 8 and is described in 
35 WO 96/34960. Electron microcopy also suggested that the 
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200 kDa protein was an adhesin. The sequence homology 
demonstrated between the M. catarrhal is 200 kDa proteins 
and other high molecular weight adhesins from different 
organisms, also suggested that it was an adhesin. Based 
5 upon these observations, an assay was developed to try 
to demonstrate that anti-r2 00 kDa antibody could block 
adherence between M. cata.rirha.lis and epithelial cells, 
thus identifying it definitively as an adhesin. 

On day 1, 24 well tissue culture plates were seeded 

10 with approximately 3 x 10 s Chang cells per well, to 
achieve a confluent monolayer following overnight 
incubation at 37°C in the presence of 5% C0 2 . M. 
catarrhal is 422 3 or Q8 was cultured in 10 ml of BHI 
broth at 3 7°C for 18 hr, shaking at 2 00 rpm. 

15 On day 2, bacterial cultures were pelleted by 

centrifugation at 3500 rpm for 10 min, and washed with 
10 ml of PBS. After a centrifugation as above, each 
pellet was resuspended in 2 ml of DMEM supplemented with 
10% FBS and 2 mM glut amine . The bacteria cultures were 

2 0 diluted 1/10 in the supplemented DMEM to OD of 

approximately 1.8 at 578 nm. Confluent monolayers of 
Chang cells were washed once with 1 ml of PBS per well, 
and 0.5 ml of 10% BSA in PBS was added to each well as a 
blocking agent. Plates were incubated at 3 7°C for 3 0 min 
25 and monolayers were washed twice with PBS as above. 

A guinea pig ant i -4223 M56 r2 0 0 kDa antiserum, 
prepared as described in Example 10 and pooled pre- 

immune guinea pig sera were heated at 56°C for 3 0 min to 
inactivate endogenous complement. Equal volumes of 

3 0 appropriately diluted antisera and bacteria were mixed, 

and 2 00 jjlI of the mixture were added into each well. 
Examples of antiserum dilutions tested included 1/4, 
1/16 and 1/64. The plate was incubated at 37°C for 1 hr, 
with gentle shaking. The plate was carefully washed four 
3 5 times with 1 ml of PBS per well to remove the bacteria. 
To each well, 100 |ul of trypsin were added, and the 
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plate was incubated at 3 7°C for 5 min. After 
inactivation of trypsin by addition of 900 |al Dulbecco's 
Minimal Essential Medium (DMEM) to each well, the cells 
were resuspended by pipetting up and down several times. 
5 Ten- fold dilutions of resuspended cells were 

prepared in a new 96 -well plate. Fifty pi each of the 1 
x 10~ 2 , 1 x 10~ 3 , 1 x 10" 4 and 1 x 10" 5 diluted samples 
were plated on a Mueller-Hinton agar plate. Plates were 
incubated at 3 7°C overnight, and then left at room 

10 temperature for a further 24 hours. The number of 
colonies per plate was counted for the estimation of the 
total bound bacteria. 

Dilution plating was also carried out for each 
bacterial strain, to estimate bacterial concentrations 

15 and to calculate the total amount of bacteria added to 
each well. It was assumed that the number of bacteria 
bound to tissue culture cells in the presence of pre- 
immune sera represented 10 0% optimal binding for each 
assay, and 0% inhibition. Therefore, in order to 

2 0 calculate the percent inhibition of the antiserum, we 
used the following formula: 

% inhibition=100 - total bacteria bound in 4223 anti-r200 kDa antiserum samples x 100 
L total bacteria bound in pre-immune sera samples _ 

When the guinea pig 4223 anti-M56 r2 0 0 kDa protein 

serum was examined for the inhibition of binding of 

strain 4223 to Chang cells (Table 8) , inhibition of 98%, 

25 92% and 83% was observed at antiserum dilutions of 1/4, 
1/16 and 1/64, respectively. With the heterologous 
strain Q8, the inhibition of binding to the tissue 
culture cells was estimated to be 77%, 82% and 55% at 
antiserum dilutions of 1/4, 1/16 and 1/64, respectively. 

30 The results clearly showed that anti-M56 r200 kDa 
protein serum inhibited the binding of AT. catarrhalis to 
cultured human epithelial cells. 

Having demonstrated that 4223 anti-M56 r200 kDa 
antibody could block adherence of AT. catarrhalis strains 

35 4223 or Q8 to Chang epithelial cells in a dose -dependent 
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manner, the studies were extended to other strains. Of 
particular interest, were those strains that were not 
killed by anti-M56 r200 kDa antisera in the bactericidal 
antibody assay. To perform the in vitro adherence assay 
5 on several strains, a single antibody dilution of 1/16 
was used. The data for inhibition of in vitro adherence 
to Hep-2 cells is summarized in Table 9. The procedure 
for the Hep-2 epithelial cells was identical to the 
Chang cell procedure described above. The 4223 anti-M56 

10 r200 kDa antibody effectively blocked adherence of the 
homologous strain by 48%. Strain RH408 does not express 
the 2 00 kDa gene and in the assay, antibody inhibited 
adherence of RH408 to 9%. This would be assumed to be a 
background 1 evel . Of 20 strains tested, 16 were 

15 inhibited at rates higher than 9%. Among these strains 
were 19 strains that had not been killed by the 4223 
anti-M56 r2 00 kDa antibody. 

To summarize and as shown in Tables 1, 8 and 9, in 
our collection of 89 strains of Moraxella ca.tarrha.lis, 

20 80 express 200 kDa. Of 57 strains tested with 4223 anti- 
M56 r200 kDa antibody in the bactericidal antibody 
assay, 39 were killed (58%) . An additional 15 strains 
were inhibited from binding to epithelial cells by the 
same antibody for a total of 54 strains (95%) , against 

25 which a single antibody was effective. These data 
demonstrate the very high potential of r2 0 0 kDa proteins 
as vaccine antigens. 
Example 13 

This Example describes the sequence analysis of the 
30 200 kDa protein gene from M. catarrhalis strain RH408, 
the non-clumping variant of 4223 described in WO 
96/34960 . 

As described in Example 4 and Table 5, it appeared 
that the number of G nucleotides in the G tract had a 
35 regulatory function on the expression of the 200 kDa 
gene. M. catarrhalis strain 4223 and its non-clumping 
derivative RH408 appeared to differ only in the 
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expression of the 200 kDa gene. The 200 kDa gene from 
strain RH408 was subcloned and sequenced and its 
sequence compared to the parental gene from strain 4223. 
Four partially overlapping fragments of the 200 kDa 
5 protein gene were PCR amplified from strain AT. 
catarrhalis RH408, using primers illustrated in Figure 
16 and Table 10, under the conditions set out in Table 
11. The combined sequences of the four PCR products 
covered approximately 6.5 kb including the entire 200 

10 kDa protein gene and its flanking regions. When the 
sequence of the 6.5 kb fragment was compared with the 
sequence of the same region from its parent strain 4223, 
the only difference was the number of G nucleotides in 
the G tract. As described in Example 4, the correct 

15 number of G nucleotides in the G tract was nine. 
However, the number G nucleotides in the G tract of 
RH408 was only eight. 

This result, along with the analysis of this region 
in 24 other strains of M. catarrhalis (Table 5) strongly 

2 0 suggests that the number of G nucleotides in the G tract 
controls the expression of the 2 00 kDa gene in M. 
catarrhalis strains. Similar mechanisms of 

transcriptional control are found for other bacterial 
genes, such as the N. gonorrhehoae Pile gene (ref . 32) . 

2 5 Example 14 

This Example describes the generation of additional 
N- terminal truncated r2 0 0 kDa proteins and expression 
studies . 

As described in Example 6, the full-length r2 0 0 kDa 

3 0 protein appeared to be toxic to E. coli and could not be 

expressed under normal induction conditions. The M56 
r2 0 0 kDa proteins were readily expressed, as described 
in Example 7, and were subsequently shown to be highly 
promising vaccine candidates in in vitro assays 
3 5 (Examples 11 and 12) . The expression of r2 0 0 kDa 
proteins of intermediate length and their properties was 
studied. 
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Three additional N- terminal truncated 200 kDa genes 
were constructed from the 4223 200 kDa gene using the 
procedures described in Example 7. The sites of 
truncation were chosen based upon and are illustrated in 
5 Figure 17. The arrows in Figure 17 indicate the sites of 
truncation, namely ALA 12 , VAL 19 and GLY 39 , each modified 
to MET. A 5 1 fragment up to an internal site was PCR 
amplified using primers illustrated in Table 8. For the 
ALA 12 truncation, the primers were 5' 6242. ks and 3' 

10 4257. ks, for the VAL 19 truncation, the primers were 5' 
6243. ks and 3' 4257. ks and for the GL4 39 truncation, the 
primers were 5' 6244. ks and 3' 4257. ks (Table 10). The 
amplification conditions were the same as those used for 
pKS348 (Table 11) . The PCR products were restricted 

15 with Ndel and ligated into the Ndel sites of pKS348 for 
expression. While some expression of r2 00 kDa was 

obtained with each of the N-terminal truncations, the 
level did not approach the levels obtained using pKS348. 
Example 15 

2 0 This Example illustrates the construction of 

plasmids pQWE and pQWF expressing C-terminal fragments 
of the 200 kDa gene. 

As shown in the amino acid comparison of Figure 6, 
the carboxy half of the 2 00 kDa protein in quite 

2 5 conserved, the main difference being a large 

approximately 3 00 amino acid residue insert in strain 
LES-1. Since so much cross-reactivity for the anti-M56 
r2 0 0 kDa ant i sera had been observed, the conserved 
carboxy half of the protein was expressed. 

3 0 Plasmid pKS348 prepared as described in Example 7 

was digested with restriction enzymes, Nde I and Nae I, 
producing four fragments. The approximately 5.8 kb Nde 
1/Na.e I fragment containing the T7 promoter, ampicillin 
antibiotic resistance marker and the 3 ' end of the 200 
3 5 kDa gene was agarose gel purified. The approximately 
480 bp Nde I/Nde I fragment containing the 5' end of the 
2 00 kDa gene was also gel purified. This approximately 
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480 bp fragment was then restriction digested with the 
enzymes Nla IV and Pst I and the Nde X/Nla IV fragment 
ligated to the previously isolated 5.8 kb Nde I/Nae I 
fragment to produce plasmid pQWE, as illustrated in 
5 Figure 19. This plasmid construct contained a 200 kDa 
gene with the Nla IV to Nae I fragment deleted. This 
plasmid construct resulted, upon expression as described 
in Example 7, in a fusion 200 kDa protein containing a 
very short piece of the 5' end and the 3 r half of the 

10 2 00 kDa protein. 

An approximately 500 bp fragment around the Eco RI 
site in the 200 kDa gene from plasmid pKS348 was PCR 
amplified utilizing a 5' oligonucleotide, 6425. KS and a 
3' oligonucleotide 4272. KS (Table 10) using the 

15 conditions outlined in Table 11. The 5 1 oligonucleotide 
was synthesized with an ATG translational start codon 
and a Nde I restriction site, while the 3 T 
oligonucleotide was synthesized with an Eco RI site. The 
approximately 5 00 bp PCR fragment was the restriction 

2 0 digested with the enzymes Nde I and Eco RI . Plasmid 

pQWE, prepared as described above, was restriction 
digested with Nde I and Eco RI as illustrated in Figure 
20, and this larger fragment agarose gel purified. The 
Nde 1 1 Eco RI PCR fragment was then ligated into the 
25 isolated Nde I /Eco RI fragment from pQWE, to produce 
plasmid pQWF. This construct expresses a 5' truncated 
200 kDa protein, having only the 3' half of this protein 
from the region about 4 0 bp upstream of the Nde I site 
to the 3 1 end. 

3 0 The constructs pQWE and pQWF, prepared as described 

above and as illustrated in Figures 19 and 20, were 
expressed in E. coli strain BL21 (DE3 ) /pLysS as described 
in Example 7. The C-terminal half proteins were obtained 
at levels of expression approximately twice those 
35 achieved using pKS348. Corresponding constructs were 
prepared from strain LES-1 and produced comparable 
results . 



Antiserum was raised against the C-terminal half of 
2 00 kDa protein produced from construct pQWE following 
the procedure of Example 10 and was employed in the 
bactericidal assay described in Example 11. As may be 
seen in Table IB the antiserum showed more than 3 0% of 
killing against 30 out of 31 strains which were killed 
by the bactericidal assay using antiserum raised against 
the product from pKS348. 

SUMMARY OF THE DISCLOSURE 

In summary of this disclosure, nucleotide sequences 
encoding an about 2 00 kDa outer membrane protein from 
several strains of Moraxella catarrhalis are described 
along with recombinant production of such protein. 
Modifications are possible within the scope of this 
invention. 



46 

Table 1A 

Examination of 2 00 kDa protein in M. catarrhalis strains 



STRAIN 


ANATOMICAL 
ORIGIN 


SOURCE 


EXPRESSION OF 
200 kDa 
PROTEIN 


4223 


A /f TT\ T7 A T» TTT T TTT"\ 

MID. EAR rLUID 


T.F. MURPHY 


+++ 


T>TT/1 AO 


A yTT TT A XT HP /"\T7 /nil 

MU 1 AJN 1 Or 4223 




- 


D 


OTJT TTT TA ,f 

bPU I UM 




- 


JO 


bFU I UM 






ljj 


TV/n~r\ ~c a t> tjt Trrr\ 
NILL). HA_K rJLUID 


i 


+++ 




rS I JbivblVLLA. 




+ 


^ 1 Q1 


A jfTT\ CAT) TTT T TTT\ 

MID. bAR rLUID 


it 


-H-+ 


Q1QC 


XT A C r^TJTJr A T> ^TTvTV 

NASOPHARYNX 




-f-H- 


iVlZ 


OTVT TTT TA A 

&rU 1 UM 




-H-+ 


Mj 


OTYT TTT TA JT 
bPU 1UM 




~ 


A1CC25240 




ATCC 


- 


tt r\ /i 

H-04 


OTITIS 


G.D. CAMPBELL 


+++ 


TT 1 O 






- 


rU-34 






+++ 


PO-51 






+++ 


E-07 






+++ 


E-22 






+++ 


E-23 






+++ 


E-24 






+++ 


M-02 






+++ 


M-20 




« 


-HH- 


M-29 






+++ 


M-32 






+++ 


M-35 






+++ 


Q-2 


EXPECTORATION 


M.G. BERGERON 


+ 


Q-6 






- 


Q-8 






+++ 


0-9 






- 


Q-10 






+-H- 


Q-ll 






+-H- 


Q-12 






- 


R-l 


BRONCHIAL 
SECRETIONS 




+ 


R-2 








R-4 


OTTTT^ 

\J ±11 10 




i I r 


R-5 


<c 




+++ 


R-6 




a 


+++ 


R-7 




a 


+++ 


N-209 


BLOOD 


t 


+++ 


VH-1 


OTITIS 


V. HOWIE 


+++ 


VH-2 






+-H- 


VH-3 




n 


+++ 


VH-4 




tt 


+-H- 


VH-5 




<t 


+-H- 


VH-6 




tc 


+++ 


VH-7 




it 


+++ 



47 



VH-8 




4i 


1 1 1 


VH-9 




tt 


-\~\~{- 


VH-10 




41 


1 1 1 


VH-11 




tt 


+++ 


VH-12 




tt 


+++ 


VH-13 




44 




VH-14 




tt 


+++ 


V 11"! J 


„ 


it 


, [,.i,.„j_ 


VH-16 




Ct 


_) | |_ 


VH-17 




tt 


n I r 


V 1 J. 1 O 


„ 


tt 


n 1 r 


VH-1 Q 


u 


tt 


4-4-4- 

n 1 r 
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n 1 r 


V 1 1 J> 


(( 


tt 


_| 1 L 
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tt 
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„ 


tt 


+++ 


VH-26 




« 


+++ 
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« 


tt 


_] ] j_ 


V IT- Z.O 




tt 


_l i t_ 
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tt 
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OTTTT^ 




_| 1 L 


T F^9 
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_i i l 
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tt 


|_ 
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ct 


_| 1 L 
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(( 


(( 
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T F<s7 
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_j i i_ 
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(( 
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J 1 |_ 


T F<sQ 




tt 


1 1 1 
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(( 


tt 
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(( 


tt 


4-4-4- 

i I 1 
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tt 


| | | 
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tt 


_| l_4_ 

n 1 r 


T F<s17 


u 


(t 


_j 1 |_ 


LES21 




tt 


4-4_j_ 


30607 


OTITIS 


C.W. FORD 


+++ 


CJ1 




C. JOHNSON 


+++ 


CJ3 




tc 


+++ 


CJ4 




tt 


+++ 


CJ7 






-H-f- 


CJ8 




tt 


-H-+ 


CJ9 






+++ 


CJ11 




tt 


+++ 



Bacteria were lysed and proteins were separated on SDS-PAGE gels. 
The expression of 2 00 kDa protein was examined by Coomassie Blue 
staining and by Western blot using anti-2 00 kDa protein guinea pig 
serum. 
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TABLE IB 

Bactericidal assay results against Moraxella catarrhalis 
using antisera raised against recombinant M56 200 kDa 
protein from strains 4223 and LES1, and recombinant C- 
terminal half of 200 kDa protein from strain 4223. 



STRAIN 


Killed by anti- 


Killed by anti- 


Killed by anti- 




M56 200 kDa 


C- terminal half 


M56 200 kDa 




from 4223 


of 2 00 kDa from 
4223 


from LES1 


4223 




++ 


- 


135 


+ + 


++ 


++ 


H-04 


+ + 


++ 




H-12* 


- 


NT 


- 


PO-34 


- 


NT 


++ 


PO-51 


- 


NT 


- 


E-07 


- 


NT 


++ 


E-22 




++ 


- 


E-24 


- 


NT 


- 


M-02 


++ 


++ 




M-20 


++ 


+ 


- 


M-29 


++ 


++ 


++ 


M-32 


++ 


++ 


++ 


M-35 


++ 


++ 


++ 


R4 


- 


NT 


++ 


R5 


++ 


++ 


++ 


R6 


++ 


+ 


+ 


R7 


++ 


NT 


o 


Q8** 


++ 


+ 


NT 


VH-l 


++ 


NT 


++ 


VH-2 


++ 


NT 


+ + 


VH-4 


- 


NT 




VH-5 


++ 


++ 


- 


VH-7 


++ 


+ 




VH-8 


++ 


++ 


+ + 


VH-9 


- 


NT 


+ + 


VH-10 


++ 


++ 


++ 


VH-13 


- 


NT 


- 


VH-15 


++ 


++ 




VH-17 


- 


NT 


- 


VH-19 


++ 


+ + 


++ 


VH-20 


+ 


+ 


++ 


VH-23 


+ 


NT 


++ 


VH-24 


++ 


+ + 


- 


VH-25 


- 


NT 


++ 


VH-26 




NT 


++ 


VH-27 




NT 




VH-28 


+ 


NT 




VH-29 


++ 


++ 


++ 


VH-30 




NT 


++ 


LES1 




NT 


++ 


LES2 


++ 


++ 


+ 


LES4 


+ 


NT 


++ 


LESS 




NT 


++ 


LES9 




++ 


++ 


LES11 


+ 


+ 


+ 


LES12 




NT 


*? 


LES13 




NT 


++ 


LES16 


+ 


++ 


++ 


LES17 


+ + 


++ 




LES21 


++ 


++ 




30607 


+ 


NT 


++ 
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CJ1 


++ 


++ 


++ 


CJ3 


++ 




++ 


CJ4 


++ 


++ 


++ 


CJ7 


++ 


++ 


++ 


CJ8 


+ + 


++ 


? 



This strain does not produce 20 0 kDa protein. 



** This is the only non-otitis media strain (isolated from 
expectorate) in this Table. 

++: Killed more than 60% (>60%) , +: killed between 30% and 60%, 
-: killed 30% or less, NT: not tested, ?: the results not tested. 
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TABLE 2 



The number of G nucleotides in the G tract of the 200 
kDa protein gene determined by sequencing of subcloned 
genes from a A.EMBL3 clone. 



Plasmid* 


Number of G's 


pKSlO 


10 


pKS59 


10 


PKS63 


10 


PKS71 


10 



* pKSlO and pKS71 carried a DNA insert directly 
subcloned from a A,EMBL3 clone. pKS59 and pKS63 carried a 
subcloned DNA fragment, pKS9, which was a subclone from 

an XEMBL3 clone. pKS59, pKS63 and pKS71 carried 
identical DNA inserts. 



TABLE 3 

The number of G nucleotides in the G tract of the 2 00 
kDa protein gene amplified by PGR from subcloned genes 



Primers 


Template DNA 


Number of G's 


4211 and 4213 


pKS9 


10 


4211 and 4213 


pKSlO 


10 


4211 and 4213 


pKS71 


10 



* pKS9, pKSlO and pKS71, which contain a 5 1 fragment of 
the 200 kDa protein gene, were independently subcloned 
from the WSMBL3 clone . 
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TABLE 4 

The number of G nucleotides in the G tract of the 2 00 
kDa protein gene amplified by PCR from chromosomal DNA 
of strain 4223 



Primers 


Template 


Number of G 


4211 and 4166 


4223B 


9 


4211 and 4213 


4223B 


9 


4211 and 4213 


4223R 


9 



* The template chromosomal DNAs, 422 3B and 4223R, were 
independently prepared from M. catarrhal is strain 4223. 



TABLE 5 



The number of G nucleotides in the G tract in different 
strains of M. catarrhalis 



Expression 


Number of G 


Number of strains 
examined 


Possible start codon 


+++ 


3 


1 


ATG 


+++ 


6 


7 


ATG 


+++ 


S 


7 


ATG 


+ 


10 


3 


GTG 




7 


3 


GTG 




8 


2 


GTG 




9 


1* 


ATG 


Total 


24 





* The 2 00 kDa protein gene of this strain was 
prematurely terminated by a stop codon. 
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TABLE 6 

Anti-M56 r2 00 kDa antibody titers in guinea pig and rabbit 

sera 



ANTISERA 


i-iTy J. j-iD\ju , 

Against M56 r2 0 0 kDa 
(4223) 


Against MS 6 r200 kDa 
(LES-1) 


Gp anti-r200 kDa (4223) 


204,800 
409,600 


102,400 
409, 600 


Gp anti-r200 kDa (LES1) 


204,800 
102,400 


1, 638,400 
1,638,400 


Rb anti-r200 kDa (4223) 


102,400 
102,400 


102,400 
102,400 


Rb anti-r200 kDa (LES1) 


25,600 
102,400 


204, 800 
409,600 
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TABLE 7 

Killing of M. catarrhalis strain 4223 by the 
bactericidal antibody activity of guinea pig anti-M5 6 
r2 00 kDa protein serum 



Serum dilution 


1/64 


1/128 


1/256 


1/512 


1/1024 


Killing % 


97% 


95% 


95% 


80% 


38% 



* The guinea pig antiserum was raised against M56 r200 
kDa protein from strain 4223, and the bactericidal 
antibody activity of the serum at various dilutions were 
examined against the strain 4223. 



TABLE 8 



Inhibition of the binding of M. catarrhalis strains to 
Chang cells by guinea pig anti-M56 r200 kDa protein 
serum 



Strain 


1/4 


1/16 


1/64 


4223 


98% 


92% 


83% 


Q8 


77% 


82% 


55% 



* The guinea pig antiserum was raised against M56 r200 
kDa protein from strain 4223. 
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TABLE 9 



Inhibition of In vitro adherence of Moraxella catarrhalis to 
Hep-2 cells by antiserum raised against recombinant 200 kDa 
protein from strain 4223 



STRAIN 


Inhibition 


4223* 


+++ 


PO-34 


+++ 


PO-51 


++ 


E-07 


++ 




++ 


VH-4 


++ 


VH-9 




VH-13 


+ 


VH-17 


++ 


VH-23 


++ 


VH-25 


++ 


VH-26 


+++ 


VH-27 


+ 


VH-28 


+++ 


LES1 


++ 


LES4 




LES12 




LES13 




30607 


+ 



+++: Inhibition was 30% or higher, ++ : Inhibition was 20% to 
30%, +: Inhibition was 15% to 20%, -: Inhibition was 
lower than 15%. 

*: This strain is the positive control, and the only 
strain in this Table, which was killed by the 
bactericidal activity of ant i -recombinant 2 00 kDa 
protein serum. 
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TABLE 10 



Nucleotide sequences of primers used for PCR 
amplifications 



PRIMER 


NUCLEOTIDE SEQUENCE 


SEQ ID No: 


4211. KS 


GATGCCTACGAGTTGATTTGGGT 


14 


4213 .KS 


GAGCGTTGCACCGATCACGAGGA 


15 


4166. KS 


CACTAGCCTTTACATCACCACCGATG 


16 


5295 .KS 


AAGGTAAACCCATATGAATCACATCTATAAAGTCA 


17 


4260 .KS 


GCTTCTAGCTGTGCCACATTGA 


18 


5471 .KS 


CGCTCGCTGTCCATATGATCGGTGCAACGCTCA 


19 


4257 .KS 


GACCCTGTGCATATGACATGGCT 


20 


4254 .KS 


CCTTGGCATCAATCGTGGCACA 


21 


4 Z / b . Kb 


TTAC C TGC AT CAATGC CATTGT C T 


22 


4329. KS 


CTGAGGTGAATACAACTACA 


23 


4272 .KS 


CATCAGAGGTCTTTGAGGTGTCAT 


24 


4118 .KS 


CATCACCGTGGGTCAAAAGAACGCA 


25 


4267 .KS 


GATGTCGGCAATGTTTACCTGA 


26 


4269. KS 


CCACATTGACCAGTACTGGCACAGGTGCTA 


27 


4981 . KS 


AC CTATGATCAATGGCGATTTGGT 


28 


6425 . KS 


AAAGATCATATGGTTAC CTTTGGCATTAAC 


29 


6242 


GTCATCTTTCATATGGCCACAGGCACA 


30 


6243 


ACATTTATGCATATGGCAGAGTACGCCA 


31 


6244 


GCTACAGGGCATATGGGCAGTGTATGCACT 


32 
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TABLE 11 
PCR Cycle Conditions 

1. For the construction of pKS2 94, oligonucleotides 5295 
and 4260 and of pKS348, oligonucleotides 5471 and 4257: 
95°C for 2 min -> 95°C for 1 min, 60°C for 3 0 sec, 72°C 
for 1 min (10 cycles) 95°C for 1 min, 62°C for 30 
sec, 72°C for 1 min (20 cycles with extension of 1 
sec/cycle) 72°C for 10 min 4°C. 

2. For the construction of pQWF, oligonucleotides 6425 and 
4272 : 

95°C for 2 min 95°C for 1 min, 60°C for 30 sec, 72°C 
for 1 min (10 cycles) -> 95°C for 1 min, 60°C for 30 
sec, 72°C for 1 min (20 cycles with extension of 1 
sec/cycle) 72°C for 10 min 4°C. 

3. For the amplification of 700 bp fragment for sequencing 
the G-nucleotide tract from different strains, 
oligonucleotides 4211 and 4166. 

95°C for 2 min _> 95°C for 1 min, 60°C for 1 min, 72°C 
for 2 min (10 cycles) -> 95°C for 1 min, 60°C for 1 min, 
72°C for 2 min (20 cycles with extension of 5 sec/cycle) 

-> 72°C for 10 min -> 4°C. 

4. For sequencing 200 kDa protein from M. catarrhalis 
strain RH408, 

(a) oligonucleotides 4254 and 4278; 4118 and 4267; and 
4269 and 4981: 

95°C for 2 min _> 95°C for 1 min, 62°C for 3 0 sec, 72°C 
for 1 min (10 cycles) -> 95°C for 1 min, 62°C for 30 
sec, 72°C for 1 min (20 cycles with extension of 2 
sec/cycle) -» 72°C for 10 min — > 4°C. 

(b) oligonucleotides 4329 and 4272 

95°C for 2 min 95°C for 1 min, 58°C for 3 0 sec, 72°C 
for 1 min 30 sec (10 cycles) — > 95°C for 1 min, 58°C for 
3 0 sec, 72°C for 1 min 3 0 sec (20 cycles with extension 
of 1 sec/cycle) 72°C for 10 min -> 4°C. 
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CLAIMS 

What we claim is: 

1. An isolated and purified nucleic acid molecule 
having a nucleotide sequence selected from the group 
consisting of: 

(a) a nucleotide sequence set forth in Figure 3, 
4 or 5 (SEQ ID Nos : 5, 6, 8, 9, 11, 12) for 
Moraxella catarrhal is strains 4223, Q8 and LES-1 
respectively or the complementary sequence thereto, 

(b) a nucleotide sequence encoding an about 2 00 
kDa outer membrane protein of a strain of Moraxella 
catarrhal is and having the derived amino acid 
sequence shown in Figures 3, 4 or 5 (SEQ ID Nos: 7, 
10 , 13) for Moraxella catarrhalis strains 4223, Q8 
and LES-1 respectively, and 

(c) a nucleotide sequence encoding an about 200 
kDa outer membrane protein of another strain of 
Moraxella catarrhalis which is characterized by a 
tract of consecutive G nucleotides which is 3 or a 
multiple thereof in length, an ATG start codon 
about 8 0 to 90 bp upstream of said tract and said 
tract being located between about amino acids 2 5 
and 35 encoded by the nucleotide sequence. 

2 . The nucleic acid molecule of claim 1 wherein said 
another strain of Moraxella catarrhalis in (c) is a 
strain as identified in Table 1A other than strains 
4223, Q8 and LES-1 and expressing an about 200 kDa 
protein. 

3 . An isolated and purified nucleic acid molecule 
having a nucleotide sequence selected from the group 
consisting of: 

(a) a nucleotide sequence set forth in Figure 8 
(SEQ ID No: 12) for a 5 1 -truncation of the gene 

encoding an about 200 kDa outer membrane protein of 

Moraxella catarrhalis strain 4223, 



(b) a nucleotide sequence encoding the derived 
amino acid sequence set forth in Figure 9 (SEQ ID 
No: 13) for a N- terminal truncation of an about 2 00 
kDa outer membrane protein of Moraxella catarrhalis 
strain 4223, and 

(c) a nucleotide sequence encoding a 5 ! - 
truncation of a gene encoding an about 2 00 
kDa outer membrane protein of another strain 
of Moraxella catarrhalis and being capable of 
expressing the corresponding N-terminally 
truncated about 2 00 kDa outer membrane 
protein from E. coli. 

4. An isolated and purified nucleic acid molecule 
which is a contiguous Nde I - Pst I fragment of SEQ ID 
No: 5. 

5. A vector for transforming a host comprising a 
nucleic acid molecule as claimed in any one of claims 1 
to 4. 

6. The vector of claim 5 which is a plasmid vector. 

7. The vector of claim 5 which has the identifying 
characteristics of pKS348 (ATCC 203529) shown in Figure 
10 or pKS294 (ATCC 203528) shown in Figure 9. 

8. The vector of claim 5 which has the identifying 
characteristic of pQWE shown in Figure 19 or pQWF shown 
in Figure 20. 

9. A host cell transformed by a vector as claimed in 
claim 5 and expressing an about 2 00 kDa protein of a 
strain of Moraxella catarrhalis or an approximately C- 
terminal half thereof. 

10. The host cell of claim 9 which is E . coli. 

11. A recombinant about 200 kDa outer membrane protein 
of a strain of Moraxella catarrhalis or an approximately 
C-terminal half thereof producible by the transformed 
host of claim 9. 

12. The recombinant protein of claim 11 producible in 
inclusion bodies. 
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13. An immunogenic composition comprising the 
recombinant about 2 00 kDa outer membrane protein or an 
approximately C-terminal half thereof of claim 11. 

14. The immunogenic composition of claim 13 formulated 
as a vaccine for in vivo administration to protect 
against disease caused by Moraxella catarrhal is . 

15. The immunogenic composition of claim 13 in 
combination with a targeting molecule for delivery to 
specific cells of the immune system or to mucosal 
surfaces . 

16. The immunogenic composition of claim 13 formulated 
as a microparticle, capsule or liposome preparation. 

17. The immunogenic composition of claim 13 further 
comprising an adjuvant. 

18. A method of inducing protection against disease 
caused by Moraxella catarrhal! s, comprising 
administering to a susceptible host an effective amount 
of the immunogenic composition of claim 13 . 

19. The method of claim 18 wherein said susceptible 
host is a human. 

20. A method for the production of an about 2 00 kDa 
outer membrane protein of a strain of Moraxella 
catarrhalis or an approximately C-terminal half thereof, 
which comprises: 

transforming a host with a vector as claimed in 
claim 5, 

growing the host cell to express the encoded about 
200 kDa protein or an approximately C-terminal half 
thereof, and 

isolating and purifying the expressed about 200 kDa 
protein or an approximately C-terminal half thereof. 

21. The method of claim 20 wherein the host cell is E. 
coli . 

22. The method of claim 20 wherein said encoded about 
200 kDa protein is expressed in inclusion bodies. 
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23. The method of claim 22 wherein said isolation and 
purification of the expressed about 200 kDa protein is 
effected by: 

disrupting the grown transformed cells to produce a 
supernatant and the inclusion bodies, 

solubilizing the inclusion bodies to produce a 
solution of the recombinant about 2 00 kDa protein, 

chromatographically purifying the solution of 
recombinant about 2 00 kDa protein free from 
contaminating proteins, and 

isolating the purified recombinant about 200 kDa 
protein. 
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ABSTRACT OF THE DISCLOSURE 
An isolated and purified outer membrane protein of 
a Moraxella strain, particularly M. catarrhalis , having 
a molecular mass of about 2 00 kDa, is provided by 
5 recombinant means. The about 200 kDa outer membrane 
protein as well as nucleic acid molecules encoding the 
same are useful in diagnostic applications and 
immunogenic compositions, particularly for in vivo 
administration to a host to confer protection against 
10 disease caused by a bacterial pathogen that produces the 
about 2 00 kDa outer membrane protein or produces a 
protein capable of inducing antibodies in a host 
specifically reactive with the about 200 kDa outer 
membrane protein. 
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Figure 2. M. catarrhalis strain 4223 XEMBL3 clone 200kDa gene 
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gca acg etc agt ggc agt get tat get caa aaa aaa gat acc aaa cat 764 
Ala Thr Leu Ser Gly Ser Ala Tyr Ala Gin Lys Lys Asp Thr Lys His 
5 10 15 

ate gca att ggt gaa caa aac cag cca aga cgc tea ggc act gec aag 812 
He Ala He Gly Glu Gin Asn Gin Pro Arg Arg Ser Gly Thr Ala Lys 
20 25 30 35 

gcg gac ggt gat cga gec att get att ggt gaa aat get aac gca cag 860 
Ala Asp Gly Asp Arg Ala He Ala He Gly Glu Asn Ala Asn Ala Gin 
40 45 50 

ggc ggt caa gec ate gee ate ggt agt agt aat aaa act gtc aat gga 908 
Gly Gly Gin Ala He Ala He Gly Ser Ser Asn Lys Thr Val Asn Gly 
55 60 65 

age agt ttg gat aag ata ggt acc gat get acg ggt caa gag tec ate 956 
Ser Ser Leu Asp Lys He Gly Thr Asp Ala Thr Gly Gin Glu Ser He 
70 75 80 

gec ate ggt ggt gat gta aag get agt ggt gat gee teg att gec ate 1004 
Ala He Gly Gly Asp Val Lys Ala Ser Gly Asp Ala Ser He Ala He 
85 90 95 

ggt agt gat gac tta cat ttg ctt gat cag cat ggt aat cct aaa cat 1052 
Gly Ser Asp Asp Leu His Leu Leu Asp Gin His Gly Asn Pro Lys His 
100 105 no us 

ccg aaa ggt act ctg att aac gat ctt att aac ggc cat gca gta tta 1100 
Pro Lys Gly Thr Leu He Asn Asp Leu He Asn Gly His Ala Val Leu 
120 125 130 



aaa gaa ata cga age tea aag gat aat gat gta aaa tat aga cgc aca 1148 
Lys Glu He Arg Ser Ser Lys Asp Asn Asp Val Lys Tyr Arg Arg Thr 
135 140 145 

acc gca age gga cac gee agt act gca gtg gga gee atg tea tat gca 1196 
Thr Ala Ser Gly His Ala Ser Thr Ala Val Gly Ala Met Ser Tyr Ala 
150 155 160 

cag ggt cat ttt tec aac gee ttt ggt aca egg gca aca get aaa agt 1244 
Gin Gly His Phe Ser Asn Ala Phe Gly Thr Arg Ala Thr Ala Lys Ser 
165 170 175 

gee tat tec ttg gca gtg ggt ctt gee gec aca gec gag ggc caa tct 1292 
Ala Tyr Ser Leu Ala Val Gly Leu Ala Ala Thr Ala Glu Gly Gin Ser 
180 185 190 195 

aca ate get att ggt tct gat gca aca tct age teg ttg gga gcg ata 1340 
Thr He Ala He Gly Ser Asp Ala Thr Ser Ser Ser Leu Gly Ala He 
200 205 210 

gee ctt ggt gca ggt act cgt get cag eta cag ggc agt att gec eta 138 8 
Ala Leu Gly Ala Gly Thr Arg Ala Gin Leu Gin Gly Ser He Ala Leu 
215 220 225 

ggt caa ggt tct gtt gtc act cag agt gat aat aat tct aga ccg gec 1436 
Gly Gin Gly Ser Val Val Thr Gin Ser Asp Asn Asn Ser Arg Pro Ala 
230 235 240 

tat aca cca aat acc cag gca eta gac ccc aag ttt caa gee acc aat 1484 
Tyr Thr Pro Asn Thr Gin Ala Leu Asp Pro Lys Phe Gin Ala Thr Asn 
245 250 255 

aat acg aag gcg ggt cca ctt tec att ggt agt aac tct ate aaa cgt 1532 
Asn Thr Lys Ala Gly Pro Leu Ser He Gly Ser Asn Ser He Lys Arg 
260 265 270 275 

aaa ate ate aat gtc ggt gca ggt gtt aat aaa acc gat gcg gtc aat 15 8 0 
Lys He He Asn Val Gly Ala Gly Val Asn Lys Thr Asp Ala Val Asn 
280 285 290 

gtg gca cag eta gaa gcg gtg gtg aag tgg get aag gag cgt aga att 162 8 
Val Ala Gin Leu Glu Ala Val Val Lys Trp Ala Lys Glu Arg Arg He 
295 300 305 

act ttt cag ggt gat gat aac agt act gac gta aaa ata ggt ttg gat 1676 
Thr Phe Gin Gly Asp Asp Asn Ser Thr Asp Val Lys He Gly Leu Asp 
310 315 320 

aat act tta act att aaa ggt ggt gca gag acc aac gca tta acc gat 1724 
Asn Thr Leu Thr He Lys Gly Gly Ala Glu Thr Asn Ala Leu Thr Asp 
325 330 335 

aat aat ate ggt gtg gta aaa gag get gat aat agt ggt ctg aaa gtt 1772 
Asn Asn He Gly Val Val Lys Glu Ala Asp Asn Ser Gly Leu Lys Val 
340 345 350 355 

aaa ctt get aaa act tta aac aat ctt act gag gtg aat aca act aca 182 0 
Lys Leu Ala Lys Thr Leu Asn Asn Leu Thr Glu Val Asn Thr Thr Thr 



360 



365 



370 



tta aat gcc aca acc aca gtt aag gta ggt agt agt agt agt act aca 1868 
Leu Asn Ala Thr Thr Thr Val Lys Val Gly Ser Ser Ser Ser Thr Thr 
375 380 385 

get gaa tta ttg agt gat agt tta acc ttt acc cag ccc aat aca ggc 1916 
Ala Glu Leu Leu Ser Asp Ser Leu Thr Phe Thr Gin Pro Asn Thr Gly 
390 395 400 

agt caa age aca age aaa acc gtc tat ggc gtt aat ggg gtg aag ttt 1964 
Ser Gin Ser Thr Ser Lys Thr Val Tyr Gly Val Asn Gly Val Lys Phe 
405 410 415 

act aat aat gca gaa aca aca gca gca ate ggc act act cgt att acc 2 012 
Thr Asn Asn Ala Glu Thr Thr Ala Ala He Gly Thr Thr Arg He Thr 
420 425 430 435 

aga gat aaa att ggc ttt get cga gat ggt gat gtt gat gaa aaa caa 2 060 
Arg Asp Lys He Gly Phe Ala Arg Asp Gly Asp Val Asp Glu Lys Gin 
440 445 450 

gca cca tat ttg gat aaa aaa caa ctt aaa gtg ggt agt gtt gca att 210 8 
Ala Pro Tyr Leu Asp Lys Lys Gin Leu Lys Val Gly Ser Val Ala He 
455 460 465 

acc ata gac aat ggc att gat gca ggt aat aaa aag ate agt aat ctt 2156 
Thr He Asp Asn Gly He Asp Ala Gly Asn Lys Lys He Ser Asn Leu 
470 475 480 

gcc aaa ggt age agt get aac gat gcg gtt acc ate gaa cag etc aaa 22 04 
Ala Lys Gly Ser Ser Ala Asn Asp Ala Val Thr He Glu Gin Leu Lys 
485 490 495 

gcc gcc aag cct act tta aac gca ggc get ggc ate agt gtc aca cct 2252 
Ala Ala Lys Pro Thr Leu Asn Ala Gly Ala Gly He Ser Val Thr Pro 
500 505 510 515 

act gaa ata tea gtt gat get aag agt ggc aat gtt acc gcc cca act 23 00 
Thr Glu He Ser Val Asp Ala Lys Ser Gly Asn Val Thr Ala Pro Thr 
520 525 530 

tac aac att ggc gtg aaa acc acc gag ctt aac agt gat ggc act agt 2348 
Tyr Asn He Gly Val Lys Thr Thr Glu Leu Asn Ser Asp Gly Thr Ser 
535 540 545 

gat aaa ttt agt gtt aag ggt agt ggt acg aac aat age tta gtt acc 23 96 
Asp Lys Phe Ser Val Lys Gly Ser Gly Thr Asn Asn Ser Leu Val Thr 
550 555 560 

gcc gaa cat ttg gca age tat eta aat gaa gtc aat cga acg get gac 2444 
Ala Glu His Leu Ala Ser Tyr Leu Asn Glu Val Asn Arg Thr Ala Asp 
565 570 575 

agt get eta caa age ttt acc gtt aaa gaa gaa gac gat gat gac gcc 2492 
Ser Ala Leu Gin Ser Phe Thr Val Lys Glu Glu Asp Asp Asp Asp Ala 
580 585 590 595 

aac get ate acc gtg get aaa gat acg aca aaa aat gcc ggc gca gtc 2540 



Asn Ala lie Thr Val Ala Lys Asp Thr Thr Lys Asn Ala Gly Ala Val 
600 605 610 



age ate tta aaa etc aaa ggt aaa aac ggt eta acg gtt get ace aaa 2588 
Ser He Leu Lys Leu Lys Gly Lys Asn Gly Leu Thr Val Ala Thr Lys 
615 620 625 

aaa gat ggt acg gtt acc ttt ggg ctt age caa gat age ggt ctg ace 2636 
Lys Asp Gly Thr Val Thr Phe Gly Leu Ser Gin Asp Ser Gly Leu Thr 
630 635 640 

att ggc aaa age acc eta aac aac gat ggc ttg act gtt aaa gat acc 2 684 
He Gly Lys Ser Thr Leu Asn Asn Asp Gly Leu Thr Val Lys Asp Thr 
645 650 655 

aac gaa caa ate caa gtc ggt get aat ggc att aaa ttt act aat gtg 2732 
Asn Glu Gin He Gin Val Gly Ala Asn Gly He Lys Phe Thr Asn Val 
660 665 670 675 



aat ggt agt aat cca ggt act ggc att gca aat acc get cgc att acc 
Asn Gly Ser Asn Pro Gly Thr Gly He Ala Asn Thr Ala Arg He Thr 
680 685 690 



2780 



aga gat aaa att ggc ttt get ggt tct gat ggt gca gtt gat aca aac 2 82 8 
Arg Asp Lys He Gly Phe Ala Gly Ser Asp Gly Ala Val Asp Thr Asn 
695 700 705 

aaa cct tat ctt gat caa gac aag eta caa gtt ggc aat gtt aag att 2876 
Lys Pro Tyr Leu Asp Gin Asp Lys Leu Gin Val Gly Asn Val Lys He 
710 715 720 

acc aac act ggc att aac gca ggt ggt aaa gee ate aca ggg ctg tec 2 924 
Thr Asn Thr Gly He Asn Ala Gly Gly Lys Ala He Thr Gly Leu Ser 
725 730 735 

cca aca ctg cct age att gee gat caa agt age cgc aac ata gaa ctg 2 972 
Pro Thr Leu Pro Ser He Ala Asp Gin Ser Ser Arg Asn He Glu Leu 
740 745 750 755 

ggc aat aca ate caa gac aaa gac aaa tec aac get gee age att aat 3 02 0 
Gly Asn Thr He Gin Asp Lys Asp Lys Ser Asn Ala Ala Ser He Asn 
760 765 770 

gat ata tta aat aca ggc ttt aac eta aaa aat aat aac aac ccc att 3068 
Asp He Leu Asn Thr Gly Phe Asn Leu Lys Asn Asn Asn Asn Pro He 
775 780 785 

gac ttt gtc tec act tat gac att gtt gac ttt gee aat ggc aat gee 3116 
Asp Phe Val Ser Thr Tyr Asp He Val Asp Phe Ala Asn Gly Asn Ala 
790 795 800 

acc acc gee aca gta acc cat gat acc get aac aaa acc agt aaa gtg 3164 
Thr Thr Ala Thr Val Thr His Asp Thr Ala Asn Lys Thr Ser Lys Val 
805 810 815 

gta tat gat gtg aat gtg gat gat aca acc att cat eta aca ggc act 3212 
Val Tyr Asp Val Asn Val Asp Asp Thr Thr He His Leu Thr Gly Thr 
820 825 830 835 



gat gac aat aaa aaa ctt ggc gtc aaa acc acc aaa ctg aac aaa aca 3260 
Asp Asp Asn Lys Lys Leu Gly Val Lys Thr Thr Lys Leu Asn Lys Thr 
840 845 850 

agt get aat ggt aat aca gca act aac ttt aat gtt aac tct agt gat 33 08 
Ser Ala Asn Gly Asn Thr Ala Thr Asn Phe Asn Val Asn Ser Ser Asp 
855 860 865 

gaa gat gec ctt gtt aac gec aaa gac ate gec gaa aat eta aac acc 3356 
Glu Asp Ala Leu Val Asn Ala Lys Asp lie Ala Glu Asn Leu Asn Thr 
870 875 880 

eta gee aag gaa att cac acc acc aaa ggc aca gca gac acc gec eta 3404 
Leu Ala Lys Glu He His Thr Thr Lys Gly Thr Ala Asp Thr Ala Leu 
885 890 895 

caa acc ttt acc gtt aaa aag gta gat gaa aat aat aat get gat gac 3452 
Gin Thr Phe Thr Val Lys Lys Val Asp Glu Asn Asn Asn Ala Asp Asp 
900 905 910 915 

gee aac gee ate acc gtg ggt caa aag aac gca aat aat caa gtc aac 3500 
Ala Asn Ala He Thr Val Gly Gin Lys Asn Ala Asn Asn Gin Val Asn 
920 925 930 

acc eta aca etc aaa ggt gaa aac ggt ctt aat att aaa acc gac aaa 3548 
Thr Leu Thr Leu Lys Gly Glu Asn Gly Leu Asn He Lys Thr Asp Lys 
935 940 945 

aat ggt acg gtt acc ttt ggc att aac acc aca age ggt ctt aaa gec 3596 
Asn Gly Thr Val Thr Phe Gly He Asn Thr Thr Ser Gly Leu Lys Ala 
950 955 960 

ggc aaa age acc eta aac gac ggt ggc ttg tct att aaa aac ccc act 3 644 
Gly Lys Ser Thr Leu Asn Asp Gly Gly Leu Ser He Lys Asn Pro Thr 
965 970 975 

ggt age gaa caa ate caa gtc ggt get gat ggc gtg aag ttt gec aag 3 692 
Gly Ser Glu Gin He Gin Val Gly Ala Asp Gly Val Lys Phe Ala Lys 
980 985 990 995 

gtt aat aat aat ggt gtt gta ggt get ggc att gat ggc aca act cgc 3 74 0 
Val Asn Asn Asn Gly Val Val Gly Ala Gly He Asp Gly Thr Thr Arg 
1000 1005 1010 

att acc aga gat gaa att ggc ttt act ggg act aat ggc tea ctt gat 3788 
He Thr Arg Asp Glu He Gly Phe Thr Gly Thr Asn Gly Ser Leu Asp 
1015 1020 1025 

aaa age aaa ccc cac eta age aaa gac ggc att aac gca ggt ggt aaa 3 836 
Lys Ser Lys Pro His Leu Ser Lys Asp Gly He Asn Ala Gly Gly Lys 
1030 1035 1040 

aag att acc aac att caa tea ggt gag att gec caa aac age cat gat 3884 
Lys He Thr Asn He Gin Ser Gly Glu He Ala Gin Asn Ser His Asp 
1045 1050 1055 

get gtg aca ggc ggc aag att tat gat tta aaa acc gaa ctt gaa aac 3 932 
Ala Val Thr Gly Gly Lys He Tyr Asp Leu Lys Thr Glu Leu Glu Asn 
1060 1065 1070 1075 



aaa ate age agt act gee aaa aca gca caa aac tea tta cac gaa ttc 3980 
Lys He Ser Ser Thr Ala Lys Thr Ala Gin Asn Ser Leu His Glu Phe 
1080 1085 1090 

tea gta gca gat gaa caa ggt aat aac ttt acg gtt agt aac cct tac 4028 
Ser Val Ala Asp Glu Gin Gly Asn Asn Phe Thr Val Ser Asn Pro Tyr 
1095 1100 1105 

tec agt tat gac acc tea aag ace tct gat gtc ate acc ttt gca ggt 4076 
Ser Ser Tyr Asp Thr Ser Lys Thr Ser Asp Val He Thr Phe Ala Gly 
1110 1115 1120 

gaa aac ggc att acc acc aag gta aat aaa ggt gtg gtg cgt gtg ggc 4124 
Glu Asn Gly He Thr Thr Lys Val Asn Lys Gly Val Val Arg Val Gly 
1125 1130 1135 

att gac caa acc aaa ggc tta acc acg cct aag ctg acc gtg ggt aat 4172 
He Asp Gin Thr Lys Gly Leu Thr Thr Pro Lys Leu Thr Val Gly Asn 
H40 1145 1150 1155 

aat aat ggc aaa ggc att gtc att gac age caa aat ggt caa aat acc 4220 
Asn Asn Gly Lys Gly He Val He Asp Ser Gin Asn Gly Gin Asn Thr 
1160 1165 H70 

ate aca gga eta age aac act eta get aat gtt acc aat gat aaa ggt 42 68 
He Thr Gly Leu Ser Asn Thr Leu Ala Asn Val Thr Asn Asp Lys Gly 
H75 1180 1185 

age gta cgc acc aca gaa cag ggc aat ata ate aaa gac gaa gac aaa 4316 
Ser Val Arg Thr Thr Glu Gin Gly Asn He He Lys Asp Glu Asp Lys 
1190 H95 1200 

acc cgt gee gee age att gtt gat gtg eta age gca ggc ttt aac ttg 43 64 
Thr Arg Ala Ala Ser He Val Asp Val Leu Ser Ala Gly Phe Asn Leu 
1205 1210 1215 

caa ggc aat ggt gaa gcg gtt gac ttt gtc tec act tat gac acc gtc 4412 
Gin Gly Asn Gly Glu Ala Val Asp Phe Val Ser Thr Tyr Asp Thr Val 
1220 1225 1230 1235 

aac ttt gee gat ggc aat gee acc acc get aag gtg acc tat gat gac 4460 
Asn Phe Ala Asp Gly Asn Ala Thr Thr Ala Lys Val Thr Tyr Asp Asp 
1240 1245 1250 

aca age aaa acc agt aaa gtg gtc tat gat gtc aat gtg gat gat aca 4508 
Thr Ser Lys Thr Ser Lys Val Val Tyr Asp Val Asn Val Asp Asp Thr 
1255 1260 1265 

acc att gaa gtt aaa gat aaa aaa ctt ggc gta aaa acc acc aca ttg 4556 
Thr He Glu Val Lys Asp Lys Lys Leu Gly Val Lys Thr Thr Thr Leu 
1270 1275 1280 

acc agt act ggc aca ggt get aat aaa ttt gee eta age aat caa get 4604 
Thr Ser Thr Gly Thr Gly Ala Asn Lys Phe Ala Leu Ser Asn Gin Ala 
1285 1290 1295 

act ggc gat gcg ctt gtc aag gee agt gat ate gtt get cat eta aac 4652 
Thr Gly Asp Ala Leu Val Lys Ala Ser Asp He Val Ala His Leu Asn 



1300 



1305 



1310 



1315 



acc tta tct ggc gac ate caa act gec aaa ggg gca age caa gcg aac 4700 
Thr Leu Ser Gly Asp He Gin Thr Ala Lys Gly Ala Ser Gin Ala Asn 
1320 1325 1330 

aac tea gca ggc tat gtg gat get gat ggc aat aag gtc ate tat gac 4748 
Asn Ser Ala Gly Tyr Val Asp Ala Asp Gly Asn Lys Val He Tyr Asp 
1335 1340 1345 

agt acc gat aac aag tac tat caa gec aaa aat gat ggc aca gtt gat 47 96 
Ser Thr Asp Asn Lys Tyr Tyr Gin Ala Lys Asn Asp Gly Thr Val Asp 
1350 1355 1360 

aaa acc aaa gaa gtt gec aaa gac aaa ctg gtc gec caa gee caa acc 4 844 
Lys Thr Lys Glu Val Ala Lys Asp Lys Leu Val Ala Gin Ala Gin Thr 
1365 1370 1375 

cca gat ggc aca ttg get caa atg aat gtc aaa tea gtc att aac aaa 4892 
Pro Asp Gly Thr Leu Ala Gin Met Asn Val Lys Ser Val He Asn Lys 
1380 1385 1390 1395 

gaa caa gta aat gat gee aat aaa aag caa ggc ate aat gaa gac aac 4 940 
Glu Gin Val Asn Asp Ala Asn Lys Lys Gin Gly He Asn Glu Asp Asn 
1400 1405 1410 

gee ttt gtt aaa gga ctt gaa aaa gee get tct gat aac aaa acc aaa 4988 
Ala Phe Val Lys Gly Leu Glu Lys Ala Ala Ser Asp Asn Lys Thr Lys 
1415 1420 1425 

aac gec gca gta act gtg ggt gat tta aat gee gtt gec caa aca ccg 5036 
Asn Ala Ala Val Thr Val Gly Asp Leu Asn Ala Val Ala Gin Thr Pro 
1430 1435 1440 

ctg acc ttt gca ggg gat aca ggc aca acg get aaa aaa ctg ggc gag 5 084 
Leu Thr Phe Ala Gly Asp Thr Gly Thr Thr Ala Lys Lys Leu Gly Glu 
1445 1450 1455 

act ttg acc ate aaa ggt ggg caa aca gac acc aat aag eta acc gat 5132 
Thr Leu Thr He Lys Gly Gly Gin Thr Asp Thr Asn Lys Leu Thr Asp 
1460 1465 1470 1475 

aat aac ate ggt gtg gta gca ggt act gat ggc ttc act gtc aaa ctt 5180 
Asn Asn He Gly Val Val Ala Gly Thr Asp Gly Phe Thr Val Lys Leu 
1480 1485 1490 

gec aaa gac eta acc aat ctt aac age gtt aat gca ggt ggc acc aaa 5228 
Ala Lys Asp Leu Thr Asn Leu Asn Ser Val Asn Ala Gly Gly Thr Lys 
1495 1500 1505 

att gat gac aaa ggc gtg tct ttt gta gac tea age ggt caa gec aaa 52 76 
He Asp Asp Lys Gly Val Ser Phe Val Asp Ser Ser Gly Gin Ala Lys 
1510 1515 1520 

gca aac acc cct gtg eta agt gec aat ggg ctg gac ctg ggt ggc aag 5324 
Ala Asn Thr Pro Val Leu Ser Ala Asn Gly Leu Asp Leu Gly Gly Lys 
1525 1530 1535 

gtc ate agt aat gtg ggc aaa ggc aca aaa gat acc gac get gee aat 53 72 



Val He Ser Asn Val Gly Lys Gly Thr Lys Asp Thr Asp Ala Ala Asn 
1540 1545 1550 1555 



gta caa cag tta aac gaa gta cgc aac ttg ttg ggt ctt ggt aat get 542 0 
Val Gin Gin Leu Asn Glu Val Arg Asn Leu Leu Gly Leu Gly Asn Ala 
1560 1565 1570 

ggt aat gat aac get gac ggc aat cag gta aac att gec gac ate aaa 5468 
Gly Asn Asp Asn Ala Asp Gly Asn Gin Val Asn He Ala Asp He Lys 
1575 1580 1585 

aaa gac cca aat tea ggt tea tea tct aac cgc act gtc ate aaa gca 5516 
Lys Asp Pro Asn Ser Gly Ser Ser Ser Asn Arg Thr Val He Lys Ala 
1590 1595 1600 

ggc acg gta ctt ggc ggt aaa ggt aat aac gat ace gaa aaa ctt gee 5564 
Gly Thr Val Leu Gly Gly Lys Gly Asn Asn Asp Thr Glu Lys Leu Ala 
1605 1610 1615 

act ggt ggt ata caa gtg ggc gtg gat aaa gac ggc aac get aac ggc 5612 
Thr Gly Gly He Gin Val Gly Val Asp Lys Asp Gly Asn Ala Asn Gly 
1620 1625 1630 1635 

gat tta age aat gtt tgg gtc aaa ace caa aaa gat ggc age aaa aaa 5660 
Asp Leu Ser Asn Val Trp Val Lys Thr Gin Lys Asp Gly Ser Lys Lys 
1640 1645 1650 

gee ctg etc gee act tat aac gee gca ggt cag ace aac tat ttg acc 5708 
Ala Leu Leu Ala Thr Tyr Asn Ala Ala Gly Gin Thr Asn Tyr Leu Thr 
1655 166Q 1665 

aac aac ccc gca gaa gee att gac aga ata aat gaa caa ggt ate cgc 5756 
Asn Asn Pro Ala Glu Ala He Asp Arg He Asn Glu Gin Gly He Arg 
1670 1675 1680 



ttc ttc cat gtc aac gat ggc aat caa gag cct gtg gta caa ggg cgt 
Phe Phe His Val Asn Asp Gly Asn Gin Glu Pro Val Val Gin Gly Arg 
1685 1690 1695 



5804 



aac ggc att gac tea agt gec tea ggc aag cac tea gtg gcg ata ggt 5852 
Asn Gly He Asp Ser Ser Ala Ser Gly Lys His Ser Val Ala He Gly 
1700 1705 1710 1715 

ttc cag gec aag gca gat ggt gaa gec gee gtt gee ata ggc aga caa 5 900 
Phe Gin Ala Lys Ala Asp Gly Glu Ala Ala Val Ala He Gly Arg Gin 
1720 1725 1730 

acc caa gca ggc aac caa tec ate gee ate ggt gat aac gca caa gee 5948 
Thr Gin Ala Gly Asn Gin Ser He Ala He Gly Asp Asn Ala Gin Ala 
1735 1740 1745 

acg ggc gat caa tec ate gec ate ggt aca ggc aat gtg gta gca ggt 5996 
Thr Gly Asp Gin Ser He Ala He Gly Thr Gly Asn Val Val Ala Gly 
1750 1755 1760 

aag cac tct ggt gee ate ggc gac cca age act gtt aag get gat aac 6044 
Lys His Ser Gly Ala He Gly Asp Pro Ser Thr Val Lys Ala Asp Asn 
1765 1770 1775 



agt tac agt gtg ggt aat aac aac cag ttt acc gat gcc act caa acc 
Ser Tyr Ser Val Gly Asn Asn Asn Gin Phe Thr Asp Ala Thr Gin Thr 
1780 1785 1790 1795 



6092 



gat gtc ttt ggt gtg ggc aat aac ate acc gtg acc gaa agt aac teg 6140 
Asp Val Phe Gly Val Gly Asn Asn He Thr Val Thr Glu Ser Asn Ser 
1800 1805 1810 

gtt gcc tta ggt tea aac tct gcc ate agt gca ggc aca cac gca ggc 6188 
Val Ala Leu Gly Ser Asn Ser Ala He Ser Ala Gly Thr His Ala Gly 
1815 1820 1825 

aca caa gcc aaa aaa tct gac ggc aca gca ggt aca acc acc aca gca 6236 
Thr Gin Ala Lys Lys Ser Asp Gly Thr Ala Gly Thr Thr Thr Thr Ala 
1830 1835 1840 

ggt gca acc ggt acg gtt aaa ggc ttt get gga caa acg gcg gtt ggt 62 84 
Gly Ala Thr Gly Thr Val Lys Gly Phe Ala Gly Gin Thr Ala Val Gly 
1845 1850 1855 

gcg gtc tec gtg ggt gcc tea ggt get gaa cgc cgt ate caa aat gtg 6332 
Ala Val Ser Val Gly Ala Ser Gly Ala Glu Arg Arg He Gin Asn Val 
1860 1865 1870 1875 

gca gca ggt gag gtc agt gcc acc age acc gat gcg gtc aat ggt age 6380 
Ala Ala Gly Glu Val Ser Ala Thr Ser Thr Asp Ala Val Asn Gly Ser 
1880 1885 1890 

cag ttg tac aaa gcc acc caa age att gcc aac gca acc aat gag ctt 642 8 
Gin Leu Tyr Lys Ala Thr Gin Ser He Ala Asn Ala Thr Asn Glu Leu 
1895 1900 1905 

gac cat cgt ate cac caa aac gaa aat aag gcc aat gca ggg att tea 6476 
Asp His Arg He His Gin Asn Glu Asn Lys Ala Asn Ala Gly He Ser 
1910 1915 1920 

tea gcg atg gcg atg gcg tec atg cca caa gcc tac att cct ggc aga 6524 
Ser Ala Met Ala Met Ala Ser Met Pro Gin Ala Tyr He Pro Gly Arg 
1925 1930 1935 

tec atg gtt acc ggg ggt att gcc acc cac aac ggt caa ggt gcg gtg 6572 
Ser Met Val Thr Gly Gly He Ala Thr His Asn Gly Gin Gly Ala Val 
1940 1945 1950 1955 

gca gtg gga ctg teg aag ctg teg gat aat ggt caa tgg gta ttt aaa 662 0 
Ala Val Gly Leu Ser Lys Leu Ser Asp Asn Gly Gin Trp Val Phe Lys 
I960 1965 1970 

ate aat ggt tea gcc gat acc caa ggc cat gta ggg gcg gca gtt ggt 6668 
He Asn Gly Ser Ala Asp Thr Gin Gly His Val Gly Ala Ala Val Gly 
1975 1980 1985 

gca ggt ttt cac ttt taagccataa ategcaagat tttacttaaa aatcaatctc 6723 
Ala Gly Phe His Phe 
1990 

accatagttg tataaaacag catcagcatc agtcatatta ctgatgctga tgttttttat 6783 
cacttaaacc attttacege tcaagtgatt ctctttcacc atgaccaaat cgecattgat 6843 



cataggtaaa cttattgagt aaattttatc 
gccattgacc aaaaaatgac cgatttatcc 
gcaggtcgac 



aatgtagttg ttagatatgg ttaaaattgt 6903 
cgaaaatttc tgattatgat ccgttgacct 6963 

6973 



Figure 3. M. catarrhalis strain 4223 genomic 200kDa gene. 
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c atg aat cac ate tat aaa gtc ate ttt aac aaa gec aca ggc aca ttt 589 
Met Asn His He Tyr Lys Val He Phe Asn Lys Ala Thr Gly Thr Phe 
15 10 15 

atg gca gtg gca gag tac gec aaa tec cac age ac q ggg aaa ga t age 63 7 
Met Ala Val Ala Glu Tyr Ala Lys Ser His Ser Thr Gly Gly Gly Ser 
20 25 30 

tgt get aca ggg caa gtt ggc agt gta tgc act ctg age ttt gec cgt 685 
Cys Ala Thr Gly Gin Val Gly Ser Val Cys Thr Leu Ser Phe Ala Arg 
35 40 45 

att gec gcg etc get gtc etc gtg ate ggt gca acg etc agt ggc agt 733 
He Ala Ala Leu Ala Val Leu Val He Gly Ala Thr Leu Ser Gly Ser 
50 55 60 

get tat get caa aaa aaa gat ace aaa cat ate gca att ggt gaa caa 781 
Ala Tyr Ala Gin Lys Lys Asp Thr Lys His He Ala He Gly Glu Gin 
65 70 75 80 

aac cag cca aga cgc tea ggc act gec aag gcg gac ggt gat cga gee 82 9 
Asn Gin Pro Arg Arg Ser Gly Thr Ala Lys Ala Asp Gly Asp Arg Ala 
85 90 95 

att get att ggt gaa aat get aac gca cag ggc ggt caa gec ate gee 877 
He Ala He Gly Glu Asn Ala Asn Ala Gin Gly Gly Gin Ala He Ala 
100 105 HO 

ate ggt agt agt aat aaa act gtc aat gga age agt ttg gat aag ata 925 
He Gly Ser Ser Asn Lys Thr Val Asn Gly Ser Ser Leu Asp Lys He 
115 120 125 

ggt acc gat get acg ggt caa gag tec ate gec ate ggt ggt gat gta 973 
Gly Thr Asp Ala Thr Gly Gin Glu Ser He Ala He Gly Gly Asp Val 
130 135 140 

aag get agt ggt gat gec teg att gee ate ggt agt gat gac tta cat 1021 
Lys Ala Ser Gly Asp Ala Ser He Ala He Gly Ser Asp Asp Leu His 



145 



150 



155 



160 



ttg ctt gat cag cat ggt aat cct aaa cat ccg aaa ggt act ctg att 10 69 
Leu Leu Asp Gin His Gly Asn Pro Lys His Pro Lys Gly Thr Leu lie 
165 170 175 

aac gat ctt att aac ggc cat gca gta tta aaa gaa ata cga age tea 1117 
Asn Asp Leu He Asn Gly His Ala Val Leu Lys Glu He Arg Ser Ser 
180 185 190 

aag gat aat gat gta aaa tat aga cgc aca acc gca age gga cac gec 1165 
Lys Asp Asn Asp Val Lys Tyr Arg Arg Thr Thr Ala Ser Gly His Ala 
195 200 205 

agt act gca gtg gga gec atg tea tat gca cag ggt cat ttt tec aac 1213 
Ser Thr Ala Val Gly Ala Met Ser Tyr Ala Gin Gly His Phe Ser Asn 
210 215 220 

gec ttt ggt aca egg gca aca get aaa agt gec tat tec ttg gca gtg 1261 
Ala Phe Gly Thr Arg Ala Thr Ala Lys Ser Ala Tyr Ser Leu Ala Val 
225 230 235 240 

ggt ctt gee gec aca gec gag ggc caa tct aca ate get att ggt tct 13 09 
Gly Leu Ala Ala Thr Ala Glu Gly Gin Ser Thr He Ala He Gly Ser 
245 250 255 

gat gca aca tct age teg ttg gga gcg ata gee ctt ggt gca ggt act 1357 
Asp Ala Thr Ser Ser Ser Leu Gly Ala He Ala Leu Gly Ala Gly Thr 
260 265 270 

cgt get cag eta cag ggc agt att gee eta ggt caa ggt tct gtt gtc 1405 
Arg Ala Gin Leu Gin Gly Ser He Ala Leu Gly Gin Gly Ser Val Val 
275 280 285 

act cag agt gat aat aat tct aga ccg gee tat aca cca aat acc cag 1453 
Thr Gin Ser Asp Asn Asn Ser Arg Pro Ala Tyr Thr Pro Asn Thr Gin 
290 295 300 

gca eta gac ccc aag ttt caa gee acc aat aat acg aag gcg ggt cca 1501 
Ala Leu Asp Pro Lys Phe Gin Ala Thr Asn Asn Thr Lys Ala Gly Pro 
305 310 315 320 

ctt tec att ggt agt aac tct ate aaa cgt aaa ate ate aat gtc ggt 1549 
Leu Ser He Gly Ser Asn Ser He Lys Arg Lys He He Asn Val Gly 
325 330 335 

gca ggt gtt aat aaa acc gat gcg gtc aat gtg gca cag eta gaa gcg 15 97 
Ala Gly Val Asn Lys Thr Asp Ala Val Asn Val Ala Gin Leu Glu Ala 
340 345 350 

gtg gtg aag tgg get aag gag cgt aga att act ttt cag ggt gat gat 164 5 
Val Val Lys Trp Ala Lys Glu Arg Arg He Thr Phe Gin Gly Asp Asp 
355 360 365 

aac agt act gac gta aaa ata ggt ttg gat aat act tta act att aaa 1693 
Asn Ser Thr Asp Val Lys He Gly Leu Asp Asn Thr Leu Thr He Lys 
370 375 380 

ggt ggt gca gag acc aac gca tta acc gat aat aat ate ggt gtg gta 1741 



Gly Gly Ala Glu Thr Asn Ala Leu Thr Asp Asn Asn He Gly Val Val 
385 390 395 400 



aaa gag get gat aat agt ggt ctg aaa gtt aaa ctt get aaa act tta 1789 
Lys Glu Ala Asp Asn Ser Gly Leu Lys Val Lys Leu Ala Lys Thr Leu 
405 410 415 

aac aat ctt act gag gtg aat aca act aca tta aat gec aca acc aca 1837 
Asn Asn Leu Thr Glu Val Asn Thr Thr Thr Leu Asn Ala Thr Thr Thr 
420 425 430 

gtt aag gta ggt agt agt agt agt act aca get gaa tta ttg agt gat 18 85 
Val Lys Val Gly Ser Ser Ser Ser Thr Thr Ala Glu Leu Leu Ser Asp 
435 440 445 

agt tta acc ttt acc cag ccc aat aca ggc agt caa age aca age aaa 1933 
Ser Leu Thr Phe Thr Gin Pro Asn Thr Gly Ser Gin Ser Thr Ser Lys 
450 455 460 

acc gtc tat ggc gtt aat ggg gtg aag ttt act aat aat gca gaa aca 1981 
Thr Val Tyr Gly Val Asn Gly Val Lys Phe Thr Asn Asn Ala Glu Thr 
465 470 475 480 

aca gca gca ate ggc act act cgt att acc aga gat aaa att ggc ttt 2029 
Thr Ala Ala He Gly Thr Thr Arg He Thr Arg Asp Lys He Gly Phe 
485 490 495 

get cga gat ggt gat gtt gat gaa aaa caa gca cca tat ttg gat aaa 2 077 
Ala Arg Asp Gly Asp Val Asp Glu Lys Gin Ala Pro Tyr Leu Asp Lys 
500 505 510 

aaa caa ctt aaa gtg ggt agt gtt gca att acc ata gac aat ggc att 2125 
Lys Gin Leu Lys Val Gly Ser Val Ala He Thr He Asp Asn Gly He 
515 520 525 

gat gca ggt aat aaa aag ate agt aat ctt gee aaa ggt age agt get 2173 
Asp Ala Gly Asn Lys Lys He Ser Asn Leu Ala Lys Gly Ser Ser Ala 
530 535 540 

aac gat gcg gtt acc ate gaa cag etc aaa gee gee aag cct act tta 2221 
Asn Asp Ala Val Thr He Glu Gin Leu Lys Ala Ala Lys Pro Thr Leu 
545 550 555 560 

aac gca ggc get ggc ate agt gtc aca cct act gaa ata tea gtt gat 2269 
Asn Ala Gly Ala Gly He Ser Val Thr Pro Thr Glu He Ser Val Asp 
565 570 575 

get aag agt ggc aat gtt acc gec cca act tac aac att ggc gtg aaa 2317 
Ala Lys Ser Gly Asn Val Thr Ala Pro Thr Tyr Asn He Gly Val Lys 
580 585 590 

acc acc gag ctt aac agt gat ggc act agt gat aaa ttt agt gtt aag 2 3 65 
Thr Thr Glu Leu Asn Ser Asp Gly Thr Ser Asp Lys Phe Ser Val Lys 
595 600 605 

ggt agt ggt acg aac aat age tta gtt acc gec gaa cat ttg gca age 2413 
Gly Ser Gly Thr Asn Asn Ser Leu Val Thr Ala Glu His Leu Ala Ser 
610 615 620 



tat eta aat gaa gtc aat cga acg get gac agt get eta caa age ttt 24 61 

Tyr Leu Asn Glu Val Asn Arg Thr Ala Asp Ser Ala Leu Gin Ser Phe 
625 630 635 640 

acc gtt aaa gaa gaa gac gat gat gac gec aac get ate ace gtg get 250 9 

Thr Val Lys Glu Glu Asp Asp Asp Asp Ala Asn Ala He Thr Val Ala 
645 650 655 

aaa gat acg aca aaa aat gee ggc gca gtc age ate tta aaa etc aaa 2557 

Lys Asp Thr Thr Lys Asn Ala Gly Ala Val Ser He Leu Lys Leu Lys 

660 665 670 

ggt aaa aac ggt eta acg gtt get acc aaa aaa gat ggt acg gtt acc 2 6 05 

Gly Lys Asn Gly Leu Thr Val Ala Thr Lys Lys Asp Gly Thr Val Thr 

675 680 685 

ttt ggg ctt age caa gat age ggt ctg acc att ggc aaa age acc eta 2653 

Phe Gly Leu Ser Gin Asp Ser Gly Leu Thr He Gly Lys Ser Thr Leu 
690 695 700 

aac aac gat ggc ttg act gtt aaa gat acc aac gaa caa ate caa gtc 2 701 

Asn Asn Asp Gly Leu Thr Val Lys Asp Thr Asn Glu Gin He Gin Val 
705 710 715 720 

ggt get aat ggc att aaa ttt act aat gtg aat ggt agt aat cca ggt 2 749 

Gly Ala Asn Gly He Lys Phe Thr Asn Val Asn Gly Ser Asn Pro Gly 
725 730 735 

act ggc att gca aat acc get cgc att acc aga gat aaa att ggc ttt 2 7 97 

Thr Gly He Ala Asn Thr Ala Arg He Thr Arg Asp Lys He Gly Phe 

740 745 750 

get ggt tct gat ggt gca gtt gat aca aac aaa cct tat ctt gat caa 2845 

Ala Gly Ser Asp Gly Ala Val Asp Thr Asn Lys Pro Tyr Leu Asp Gin 

755 760 765 



gac aag eta caa gtt ggc aat gtt aag att acc aac act ggc att aac 
Asp Lys Leu Gin Val Gly Asn Val Lys He Thr Asn Thr Gly He Asn 
770 775 780 



gee gat caa agt age cgc aac ata gaa ctg ggc aat aca ate caa gac 
Ala Asp Gin Ser Ser Arg Asn He Glu Leu Gly Asn Thr He Gin Asp 
805 810 815 



2893 



gca ggt ggt aaa gee ate aca ggg ctg tec cca aca ctg cct age att 2 941 
Ala Gly Gly Lys Ala He Thr Gly Leu Ser Pro Thr Leu Pro Ser He 
785 790 795 800 



2989 



aaa gac aaa tec aac get gec age att aat gat ata tta aat aca ggc 3 03 7 

Lys Asp Lys Ser Asn Ala Ala Ser He Asn Asp He Leu Asn Thr Gly 
820 825 830 

ttt aac eta aaa aat aat aac aac ccc att gac ttt gtc tec act tat 3085 

Phe Asn Leu Lys Asn Asn Asn Asn Pro He Asp Phe Val Ser Thr Tyr 
835 840 845 

gac att gtt gac ttt gee aat ggc aat gee acc acc gee aca gta acc 3133 

Asp He Val Asp Phe Ala Asn Gly Asn Ala Thr Thr Ala Thr Val Thr 
850 855 860 



cat gat acc get aac aaa acc agt aaa gtg gta tat gat gtg aat gtg 3181 
His Asp Thr Ala Asn Lys Thr Ser Lys Val Val Tyr Asp Val Asn Val 
865 870 875 880 

gat gat aca acc att cat eta aca ggc act gat gac aat aaa aaa ctt 3229 
Asp Asp Thr Thr lie His Leu Thr Gly Thr Asp Asp Asn Lys Lys Leu 
885 890 895 

ggc gtc aaa acc acc aaa ctg aac aaa aca agt get aat ggt aat aca 32 77 
Gly Val Lys Thr Thr Lys Leu Asn Lys Thr Ser Ala Asn Gly Asn Thr 
900 905 910 

gca act aac ttt aat gtt aac tct agt gat gaa gat gec ctt gtt aac 3325 
Ala Thr Asn Phe Asn Val Asn Ser Ser Asp Glu Asp Ala Leu Val Asn 
915 920 925 

gec aaa gac ate gec gaa aat eta aac acc eta gee aag gaa att cac 3373 
Ala Lys Asp He Ala Glu Asn Leu Asn Thr Leu Ala Lys Glu He His 
930 935 940 

acc acc aaa ggc aca gca gac acc gee eta caa acc ttt acc gtt aaa 3421 
Thr Thr Lys Gly Thr Ala Asp Thr Ala Leu Gin Thr Phe Thr Val Lys 
945 950 955 960 

aag gta gat gaa aat aat aat get gat gac gec aac gee ate acc gtg 3469 
Lys Val Asp Glu Asn Asn Asn Ala Asp Asp Ala Asn Ala He Thr Val 
965 970 975 

ggt caa aag aac gca aat aat caa gtc aac acc eta aca etc aaa ggt 3 517 
Gly Gin Lys Asn Ala Asn Asn Gin Val Asn Thr Leu Thr Leu Lys Gly 
980 985 990 

gaa aac ggt ctt aat att aaa acc gac aaa aat ggt acg gtt acc ttt 3 565 
Glu Asn Gly Leu Asn He Lys Thr Asp Lys Asn Gly Thr Val Thr Phe 
995 1000 1005 

ggc att aac acc aca age ggt ctt aaa gec ggc aaa age acc eta aac 3 613 
Gly He Asn Thr Thr Ser Gly Leu Lys Ala Gly Lys Ser Thr Leu Asn 
1010 1015 1020 

gac ggt ggc ttg tct att aaa aac ccc act ggt age gaa caa ate caa 3 661 
Asp Gly Gly Leu Ser He Lys Asn Pro Thr Gly Ser Glu Gin He Gin 
1025 1030 1035 1040 

gtc ggt get gat ggc gtg aag ttt gee aag gtt aat aat aat ggt gtt 3 709 
Val Gly Ala Asp Gly Val Lys Phe Ala Lys Val Asn Asn Asn Gly Val 
1045 1050 1055 

gta ggt get ggc att gat ggc aca act cgc att acc aga gat gaa att 3 757 
Val Gly Ala Gly He Asp Gly Thr Thr Arg He Thr Arg Asp Glu He 
1060 1065 1070 

ggc ttt act ggg act aat ggc tea ctt gat aaa age aaa ccc cac eta 3 805 
Gly Phe Thr Gly Thr Asn Gly Ser Leu Asp Lys Ser Lys Pro His Leu 
1075 1080 1085 

age aaa gac ggc att aac gca ggt ggt aaa aag att acc aac att caa 3 853 
Ser Lys Asp Gly He Asn Ala Gly Gly Lys Lys He Thr Asn He Gin 



1090 



1095 



1100 



tea ggt gag att gec caa aac age cat gat get gtg aca ggc ggc aag 3901 
Ser Gly Glu He Ala Gin Asn Ser His Asp Ala Val Thr Gly Gly Lys 
1105 mo ins 1120 

att tat gat tta aaa acc gaa ctt gaa aac aaa ate age agt act gec 3 94 9 
He Tyr Asp Leu Lys Thr Glu Leu Glu Asn Lys He Ser Ser Thr Ala 
1125 1130 1135 

aaa aca gca caa aac tea tta cac gaa ttc tea gta gca gat gaa caa 3997 
Lys Thr Ala Gin Asn Ser Leu His Glu Phe Ser Val Ala Asp Glu Gin 
1140 1145 1150 

ggt aat aac ttt acg gtt agt aac cct tac tec agt tat gac acc tea 4045 
Gly Asn Asn Phe Thr Val Ser Asn Pro Tyr Ser Ser Tyr Asp Thr Ser 
1155 1160 1165 

aag acc tct gat gtc ate acc ttt gca ggt gaa aac ggc att acc acc 4093 
Lys Thr Ser Asp Val He Thr Phe Ala Gly Glu Asn Gly He Thr Thr 
1170 1175 1180 

aag gta aat aaa ggt gtg gtg cgt gtg ggc att gac caa acc aaa ggc 4141 
Lys Val Asn Lys Gly Val Val Arg Val Gly He Asp Gin Thr Lys Gly 
1185 1190 1195 1200 

tta acc acg cct aag ctg acc gtg ggt aat aat aat ggc aaa ggc att 4189 
Leu Thr Thr Pro Lys Leu Thr Val Gly Asn Asn Asn Gly Lys Gly He 
1205 1210 1215 

gtc att gac age caa aat ggt caa aat acc ate aca gga eta age aac 4237 
Val He Asp Ser Gin Asn Gly Gin Asn Thr He Thr Gly Leu Ser Asn 
1220 1225 1230 

act eta get aat gtt acc aat gat aaa ggt age gta cgc acc aca gaa 42 85 
Thr Leu Ala Asn Val Thr Asn Asp Lys Gly Ser Val Arg Thr Thr Glu 
1235 1240 1245 

cag ggc aat ata ate aaa gac gaa gac aaa acc cgt gec gec age att 4333 
Gin Gly Asn He He Lys Asp Glu Asp Lys Thr Arg Ala Ala Ser He 
1250 1255 1260 

gtt gat gtg eta age gca ggc ttt aac ttg caa ggc aat ggt gaa gcg 43 81 
Val Asp Val Leu Ser Ala Gly Phe Asn Leu Gin Gly Asn Gly Glu Ala 
1265 1270 1275 1280 

gtt gac ttt gtc tec act tat gac acc gtc aac ttt gec gat ggc aat 442 9 
Val Asp Phe Val Ser Thr Tyr Asp Thr Val Asn Phe Ala Asp Gly Asn 
1285 1290 1295 

gec acc acc get aag gtg acc tat gat gac aca age aaa acc agt aaa 4477 
Ala Thr Thr Ala Lys Val Thr Tyr Asp Asp Thr Ser Lys Thr Ser Lys 
1300 1305 1310 

gtg gtc tat gat gtc aat gtg gat gat aca acc att gaa gtt aaa gat 4525 
Val Val Tyr Asp Val Asn Val Asp Asp Thr Thr He Glu Val Lys Asp 
1315 1320 1325 

aaa aaa ctt ggc gta aaa acc acc aca ttg acc agt act ggc aca ggt 4573 



Lys Lys Leu Gly Val Lys Thr Thr Thr Leu Thr Ser Thr Gly Thr Gly 
1330 1335 1340 



get aat aaa ttt gec eta age aat caa get act ggc gat gcg ctt gtc 4621 
Ala Asn Lys Phe Ala Leu Ser Asn Gin Ala Thr Gly Asp Ala Leu Val 
1345 1350 1355 1360 

aag gee agt gat ate gtt get cat eta aac ace tta tct ggc gac ate 4669 
Lys Ala Ser Asp He Val Ala His Leu Asn Thr Leu Ser Gly Asp He 
1365 1370 1375 

caa act gee aaa ggg gca age caa gcg aac aac tea gca ggc tat gtg 4717 
Gin Thr Ala Lys Gly Ala Ser Gin Ala Asn Asn Ser Ala Gly Tyr Val 
1380 1385 1390 

gat get gat ggc aat aag gtc ate tat gac agt ace gat aac aag tac 4765 
Asp Ala Asp Gly Asn Lys Val He Tyr Asp Ser Thr Asp Asn Lys Tyr 
1395 1400 1405 

tat caa gee aaa aat gat ggc aca gtt gat aaa ace aaa gaa gtt gee 4813 
Tyr Gin Ala Lys Asn Asp Gly Thr Val Asp Lys Thr Lys Glu Val Ala 
1410 1415 1420 

aaa gac aaa ctg gtc gee caa gee caa acc cca gat ggc aca ttg get 4861 
Lys Asp Lys Leu Val Ala Gin Ala Gin Thr Pro Asp Gly Thr Leu Ala 
1425 1430 1435 1440 

caa atg aat gtc aaa tea gtc att aac aaa gaa caa gta aat gat gee 4909 
Gin Met Asn Val Lys Ser Val He Asn Lys Glu Gin Val Asn Asp Ala 
1445 1450 1455 

aat aaa aag caa ggc ate aat gaa gac aac gee ttt gtt aaa gga ctt 4957 
Asn Lys Lys Gin Gly He Asn Glu Asp Asn Ala Phe Val Lys Gly Leu 
1460 1465 1470 

gaa aaa gee get tct gat aac aaa acc aaa aac gee gca gta act gtg 5005 
Glu Lys Ala Ala Ser Asp Asn Lys Thr Lys Asn Ala Ala Val Thr Val 
1475 1480 1485 

ggt gat tta aat gee gtt gee caa aca ccg ctg acc ttt gca ggg gat 5053 
Gly Asp Leu Asn Ala Val Ala Gin Thr Pro Leu Thr Phe Ala Gly Asp 
1490 1495 1500 

aca ggc aca acg get aaa aaa ctg ggc gag act ttg acc ate aaa ggt 5101 
Thr Gly Thr Thr Ala Lys Lys Leu Gly Glu Thr Leu Thr He Lys Gly 
1505 1510 1515 1520 

ggg caa aca gac acc aat aag eta acc gat aat aac ate ggt gtg gta 514 9 
Gly Gin Thr Asp Thr Asn Lys Leu Thr Asp Asn Asn He Gly Val Val 
1525 1530 1535 

gca ggt act gat ggc ttc act gtc aaa ctt gee aaa gac eta acc aat 5197 
Ala Gly Thr Asp Gly Phe Thr Val Lys Leu Ala Lys Asp Leu Thr Asn 
1540 1545 1550 

ctt aac age gtt aat gca ggt ggc acc aaa att gat gac aaa ggc gtg 5245 
Leu Asn Ser Val Asn Ala Gly Gly Thr Lys He Asp Asp Lys Gly Val 
1555 1560 1565 



tct ttt gta gac tea age ggt caa gec aaa gca aac acc cct gtg eta 52 93 
Ser Phe Val Asp Ser Ser Gly Gin Ala Lys Ala Asn Thr Pro Val Leu 
1570 1575 1580 

agt gec aat ggg ctg gac ctg ggt ggc aag gtc ate agt aat gtg ggc 5341 
Ser Ala Asn Gly Leu Asp Leu Gly Gly Lys Val He Ser Asn Val Gly 
1585 1590 1595 1600 

aaa ggc aca aaa gat acc gac get gee aat gta caa cag tta aac gaa 5389 
Lys Gly Thr Lys Asp Thr Asp Ala Ala Asn Val Gin Gin Leu Asn Glu 
1605 1610 1615 

gta cgc aac ttg ttg ggt ctt ggt aat get ggt aat gat aac get gac 543 7 
Val Arg Asn Leu Leu Gly Leu Gly Asn Ala Gly Asn Asp Asn Ala Asp 
1620 1625 1630 

ggc aat cag gta aac att gec gac ate aaa aaa gac cca aat tea ggt 5485 
Gly Asn Gin Val Asn He Ala Asp He Lys Lys Asp Pro Asn Ser Gly 
1635 1640 1645 

tea tea tct aac cgc act gtc ate aaa gca ggc acg gta ctt ggc ggt 5533 
Ser Ser Ser Asn Arg Thr Val He Lys Ala Gly Thr Val Leu Gly Gly 
1650 1655 1660 

aaa ggt aat aac gat acc gaa aaa ctt gee act ggt ggt ata caa gtg 5581 
Lys Gly Asn Asn Asp Thr Glu Lys Leu Ala Thr Gly Gly He Gin Val 
1665 1670 1675 1680 

ggc gtg gat aaa gac ggc aac get aac ggc gat tta age aat gtt tgg 562 9 
Gly Val Asp Lys Asp Gly Asn Ala Asn Gly Asp Leu Ser Asn Val Trp 
1685 1690 1695 

gtc aaa acc caa aaa gat ggc age aaa aaa gee ctg etc gec act tat 5677 
Val Lys Thr Gin Lys Asp Gly Ser Lys Lys Ala Leu Leu Ala Thr Tyr 
1700 1705 1710 

aac gec gca ggt cag acc aac tat ttg acc aac aac ccc gca gaa gec 5725 
Asn Ala Ala Gly Gin Thr Asn Tyr Leu Thr Asn Asn Pro Ala Glu Ala 
1715 1720 1725 

att gac aga ata aat gaa caa ggt ate cgc ttc ttc cat gtc aac gat 5773 
He Asp Arg He Asn Glu Gin Gly He Arg Phe Phe His Val Asn Asp 
1730 1735 1740 

ggc aat caa gag cct gtg gta caa ggg cgt aac ggc att gac tea agt 5821 
Gly Asn Gin Glu Pro Val Val Gin Gly Arg Asn Gly He Asp Ser Ser 
1745 1750 1755 1760 

gec tea ggc aag cac tea gtg gcg ata ggt ttc cag gee aag gca gat 5869 
Ala Ser Gly Lys His Ser Val Ala He Gly Phe Gin Ala Lys Ala Asp 
1765 1770 1775 

ggt gaa gec gec gtt gee ata ggc aga caa acc caa gca ggc aac caa 5917 
Gly Glu Ala Ala Val Ala He Gly Arg Gin Thr Gin Ala Gly Asn Gin 
1780 1785 1790 

tec ate gec ate ggt gat aac gca caa gee acg ggc gat caa tec ate 5965 
Ser He Ala He Gly Asp Asn Ala Gin Ala Thr Gly Asp Gin Ser He 
1795 1800 1805 



gcc ate ggt aca ggc aat gtg gta gca ggt aag cac tct ggt gec ate 6013 
Ala He Gly Thr Gly Asn Val Val Ala Gly Lys His Ser Gly Ala He 
1810 1815 1820 

ggc gac cca age act gtt aag get gat aac agt tac agt gtg ggt aat 6061 
Gly Asp Pro Ser Thr Val Lys Ala Asp Asn Ser Tyr Ser Val Gly Asn 
1825 1830 1835 1840 

aac aac cag ttt ace gat gcc act caa ace gat gtc ttt ggt gtg ggc 6109 
Asn Asn Gin Phe Thr Asp Ala Thr Gin Thr Asp Val Phe Gly Val Gly 
1845 1850 1855 

aat aac ate ace gtg acc gaa agt aac teg gtt gcc tta ggt tea aac 6157 
Asn Asn He Thr Val Thr Glu Ser Asn Ser Val Ala Leu Gly Ser Asn 
I860 1865 1870 

tct gcc ate agt gca ggc aca cac gca ggc aca caa gcc aaa aaa tct 6205 
Ser Ala He Ser Ala Gly Thr His Ala Gly Thr Gin Ala Lys Lys Ser 
1875 1880 1885 

gac ggc aca gca ggt aca acc acc aca gca ggt gca acc ggt acg gtt 6253 
Asp Gly Thr Ala Gly Thr Thr Thr Thr Ala Gly Ala Thr Gly Thr Val 
1890 1895 1900 

aaa ggc ttt get gga caa acg gcg gtt ggt gcg gtc tec gtg ggt gcc 63 01 
Lys Gly Phe Ala Gly Gin Thr Ala Val Gly Ala Val Ser Val Gly Ala 
1905 1910 1915 1920 

tea ggt get gaa cgc cgt ate caa aat gtg gca gca ggt gag gtc agt 6349 
Ser Gly Ala Glu Arg Arg He Gin Asn Val Ala Ala Gly Glu Val Ser 
1925 1930 1935 

gcc acc age acc gat gcg gtc aat ggt age cag ttg tac aaa gcc acc 63 97 
Ala Thr Ser Thr Asp Ala Val Asn Gly Ser Gin Leu Tyr Lys Ala Thr 
1940 1945 1950 

caa age att gcc aac gca acc aat gag ctt gac cat cgt ate cac caa 6445 
Gin Ser He Ala Asn Ala Thr Asn Glu Leu Asp His Arg He His Gin 
1955 I960 1965 

aac gaa aat aag gcc aat gca ggg att tea tea gcg atg gcg atg gcg 64 93 
Asn Glu Asn Lys Ala Asn Ala Gly He Ser Ser Ala Met Ala Met Ala 
1970 1975 1980 

tec atg cca caa gcc tac att cct ggc aga tec atg gtt acc ggg ggt 6541 
Ser Met Pro Gin Ala Tyr He Pro Gly Arg Ser Met Val Thr Gly Gly 
1985 1990 1995 2000 

att gcc acc cac aac ggt caa ggt gcg gtg gca gtg gga ctg teg aag 6589 
He Ala Thr His Asn Gly Gin Gly Ala Val Ala Val Gly Leu Ser Lys 
2005 2010 2015 

ctg teg gat aat ggt caa tgg gta ttt aaa ate aat ggt tea gcc gat 663 7 
Leu Ser Asp Asn Gly Gin Trp Val Phe Lys He Asn Gly Ser Ala Asp 
2020 2025 2030 

acc caa ggc cat gta ggg gcg gca gtt ggt gca ggt ttt cac ttt 6682 
Thr Gin Gly His Val Gly Ala Ala Val Gly Ala Gly Phe His Phe 



2035 2040 
taagccataa atcgcaagat tttacttaaa 
catcagcatc agtcatatta ctgatgctga 
tcaagtgatt ctctttcacc atgaccaaat 
aaattttatc aatgtagttg ttagatatgg 
cgatttatcc cgaaaatttc tgattatgat 



2045 

aatcaatctc accatagttg tataaaacag 6742 
tgttttttat cacttaaacc attttaccgc 6802 
cgccattgat cataggtaaa cttattgagt 6862 
ttaaaattgt gccattgacc aaaaaatgac 6922 
ccgttgacct gcaggtcgac 6972 



Figure 4. M. catarrhalis strain Q8 200kDa gene 



ATG aat cac ate tat aaa gtc ate ttt aac aaa gec aca ggc aca ttt 48 
Met Asn His lie Tyr Lys Val lie Phe Asn Lys Ala Thr Gly Thr Phe 
15 10 15 

atg gec gtg gcg gaa tat gec aaa tec cac agt ac q aaa aaa aa t age 96 
Met Ala Val Ala Glu Tyr Ala Lys Ser His Ser Thr Gly Gly Gly Ser 
20 25 30 

tgt get aca ggg caa gtt ggc agt gta cgc act eta age ttt gec cgt 144 
Cys Ala Thr Gly Gin Val Gly Ser Val Arg Thr Leu Ser Phe Ala Arg 
35 40 45 

att gee gcg etc get gtc etc gtg ate ggt gcg acg etc aat ggc agt 192 
He Ala Ala Leu Ala Val Leu Val He Gly Ala Thr Leu Asn Gly Ser 
50 55 60 

get tat get caa caa att act ace aag ate gaa att ggt caa aca aac 24 0 
Ala Tyr Ala Gin Gin He Thr Thr Lys He Glu He Gly Gin Thr Asn 
65 70 75 80 

aag ata aac aac acg ctg aaa ggc gat gec eta gcg aca ggt gaa gca 288 
Lys He Asn Asn Thr Leu Lys Gly Asp Ala Leu Ala Thr Gly Glu Ala 
85 90 95 

tec att get ttt ggt agt ctt tct aag gca caa ggc tct caa get att 336 
Ser He Ala Phe Gly Ser Leu Ser Lys Ala Gin Gly Ser Gin Ala He 
100 105 110 

get ate ggt agt gtc aaa cca gat cct aat aat ggt agt aat ggt aat 3 84 
Ala He Gly Ser Val Lys Pro Asp Pro Asn Asn Gly Ser Asn Gly Asn 
115 120 125 

gta ggt tec cac gee aaa ggt aac gag tec ate gee ate ggt ggt gat 432 
Val Gly Ser His Ala Lys Gly Asn Glu Ser He Ala He Gly Gly Asp 
130 135 140 

gta ttg get gag ggt gat gee teg att gec ate ggt agt gat gac tta 480 
Val Leu Ala Glu Gly Asp Ala Ser He Ala He Gly Ser Asp Asp Leu 
145 150 155 160 

tat ttg cct aag aat ctt gat ctg aag aat gaa ttt cac aaa ctt att 52 8 
Tyr Leu Pro Lys Asn Leu Asp Leu Lys Asn Glu Phe His Lys Leu He 
165 170 175 

cat ggc cat gaa ata tta aaa aaa ata caa ace tea ace gat ggt aaa 57 6 
His Gly His Glu He Leu Lys Lys He Gin Thr Ser Thr Asp Gly Lys 
180 185 190 

ate aaa tat cga cgc aca aga gca caa ggg cac gee agt act gca gtg 624 
He Lys Tyr Arg Arg Thr Arg Ala Gin Gly His Ala Ser Thr Ala Val 
195 200 205 

gga gee atg tea tat gca cag ggt cat ttt tec aac gee ttt ggt aca 672 
Gly Ala Met Ser Tyr Ala Gin Gly His Phe Ser Asn Ala Phe Gly Thr 
210 215 220 



tac gca aca get gaa get gec tat tec ttg gca gta ggt ctt gee gee 720 
Tyr Ala Thr Ala Glu Ala Ala Tyr Ser Leu Ala Val Gly Leu Ala Ala 
225 230 235 240 

caa gec aca aaa caa tct tea ate get gtt ggt tec aat gca aaa get 768 
Gin Ala Thr Lys Gin Ser Ser lie Ala Val Gly Ser Asn Ala Lys Ala 
245 250 255 

aac gcg ttt gca gcg aca gee att ggt gga aat act gta gtt aat ttg 816 
Asn Ala Phe Ala Ala Thr Ala He Gly Gly Asn Thr Val Val Asn Leu 
260 265 270 

ggt cga ggc gtt gee eta ggt ttt ggt tct cag ate ctt gat agg gat 864 
Gly Arg Gly Val Ala Leu Gly Phe Gly Ser Gin He Leu Asp Arg Asp 
275 280 285 

aat aat aca gat gee agt gec tat gta cca eta ggt aaa acg tta gca 912 
Asn Asn Thr Asp Ala Ser Ala Tyr Val Pro Leu Gly Lys Thr Leu Ala 
290 295 300 

gac cag tat aaa gee acc cgc cag ggt gat tct acg gat ata ttt tec 960 
Asp Gin Tyr Lys Ala Thr Arg Gin Gly Asp Ser Thr Asp He Phe Ser 
305 310 315 320 

att ggt aat agt aat aat aat aat age agt ate agg cgt aaa ate ate 1008 
He Gly Asn Ser Asn Asn Asn Asn Ser Ser He Arg Arg Lys He He 
325 330 335 

aat gtc ggt gcg ggt tct egg gat acc gat gcg gtc aat gtg gca cag 1056 
Asn Val Gly Ala Gly Ser Arg Asp Thr Asp Ala Val Asn Val Ala Gin 
340 345 350 

ctt aaa ttg gtg gag gaa ctg get aat cgt aaa att act ttt aag ggt 1104 
Leu Lys Leu Val Glu Glu Leu Ala Asn Arg Lys He Thr Phe Lys Gly 
355 360 365 

gat ggt gac aat aat age aat age gta gaa aga ggt ttg ggc aat act 1152 
Asp Gly Asp Asn Asn Ser Asn Ser Val Glu Arg Gly Leu Gly Asn Thr 
370 375 380 

tta act att aaa ggt gat gca cag acc aac gca tta acc gaa get aac 12 0 0 
Leu Thr He Lys Gly Asp Ala Gin Thr Asn Ala Leu Thr Glu Ala Asn 
385 390 395 400 

ate ggt gtg gta aca gat ggc aat ggt ctg aaa gtt aaa ctt get aaa 124 8 
He Gly Val Val Thr Asp Gly Asn Gly Leu Lys Val Lys Leu Ala Lys 
405 410 415 

gag ctg act gga ttg acc agt gtc tec get acc aac aaa ate acc gtt 12 96 
Glu Leu Thr Gly Leu Thr Ser Val Ser Ala Thr Asn Lys He Thr Val 
420 425 430 

agt aat acc aac aac aac aac gee gag eta caa age ggt ggt ttg acc 1344 
Ser Asn Thr Asn Asn Asn Asn Ala Glu Leu Gin Ser Gly Gly Leu Thr 
435 440 445 

ttt age cca ata aca ggt aca aaa aca gat aaa acc gtc tac age att 13 92 
Phe Ser Pro He Thr Gly Thr Lys Thr Asp Lys Thr Val Tyr Ser He 
450 455 460 



gat gga ttg aag ttt act aat gat agt aat agt at a gca act aaa ggt 144 0 
Asp Gly Leu Lys Phe Thr Asn Asp Ser Asn Ser lie Ala Thr Lys Gly 
465 470 475 480 

act act cgt att acc aaa aag aaa att ggt ttt get ggt act aat gat 1488 
Thr Thr Arg He Thr Lys Lys Lys He Gly Phe Ala Gly Thr Asn Asp 
485 490 495 

gga gtt gat gaa age aaa cct tat ctt gac aac gaa aag eta aaa gtt 1536 
Gly Val Asp Glu Ser Lys Pro Tyr Leu Asp Asn Glu Lys Leu Lys Val 
500 505 510 

ggc aac age acc eta aac agt ggt age ttg act gtt aat aac acc act 1584 
Gly Asn Ser Thr Leu Asn Ser Gly Ser Leu Thr Val Asn Asn Thr Thr 
515 520 525 

ggt aat aaa caa ate caa gtc ggt get aat ggc att aaa ttt gee aca 1632 
Gly Asn Lys Gin He Gin Val Gly Ala Asn Gly He Lys Phe Ala Thr 
530 535 540 

gtc get aat aat gtt gca aat acc tea gca aca gtc ggc act get cgt 168 0 
Val Ala Asn Asn Val Ala Asn Thr Ser Ala Thr Val Gly Thr Ala Arg 
545 550 555 560 

att acc gaa gag aaa att ggt ttt get ggt act aat gat gga gtt gat 172 8 
He Thr Glu Glu Lys He Gly Phe Ala Gly Thr Asn Asp Gly Val Asp 
565 570 575 

gaa caa gca cca tat ttg gat aaa gaa cga ctt aaa gtg ggt cgt gtt 1776 
Glu Gin Ala Pro Tyr Leu Asp Lys Glu Arg Leu Lys Val Gly Arg Val 
580 585 590 

gaa att acc aca gat agt ggt att aat get ggt aat cac aag att acc 1824 
Glu He Thr Thr Asp Ser Gly He Asn Ala Gly Asn His Lys He Thr 
595 600 605 

gga ctt act aat ggt ata gca aat acc gat gcg gtt acc ate aaa cag 1872 
Gly Leu Thr Asn Gly He Ala Asn Thr Asp Ala Val Thr He Lys Gin 
610 615 620 

etc aaa gac gee aag cct act tta aac gca ggc gat ggc ate agt att 192 0 
Leu Lys Asp Ala Lys Pro Thr Leu Asn Ala Gly Asp Gly He Ser He 
625 630 635 640 

aat agt aat aac ggg gat eta gtt gat agt agt ggc aat att acc acc 1968 
Asn Ser Asn Asn Gly Asp Leu Val Asp Ser Ser Gly Asn He Thr Thr 
645 650 655 

cca act tat aac att age gtg aaa acc act aag ctt aac agt aat ggc 2 016 
Pro Thr Tyr Asn He Ser Val Lys Thr Thr Lys Leu Asn Ser Asn Gly 
660 665 670 

acc agt ggt aat aat aaa ttt agt gtt agt aat get cat gat aac aat 2 064 
Thr Ser Gly Asn Asn Lys Phe Ser Val Ser Asn Ala His Asp Asn Asn 
675 680 685 

age tta gtt acc gee aaa gat ttg gca gac tat eta aat aaa gtc aat 2112 
Ser Leu Val Thr Ala Lys Asp Leu Ala Asp Tyr Leu Asn Lys Val Asn 



690 



695 



700 



gaa acg get gac agt get eta cca age ttt aaa gtc caa aac ggt gat 2160 
Glu Thr Ala Asp Ser Ala Leu Pro Ser Phe Lys Val Gin Asn Gly Asp 
705 710 715 720 

aat age aac aac gec ate ace gtg ggt aaa gat aca aac ggc aag ace 2208 
Asn Ser Asn Asn Ala He Thr Val Gly Lys Asp Thr Asn Gly Lys Thr 
725 730 735 

ttc aac acc tta aaa etc aaa ggt gaa aac ggt gtt aat att acg acc 2256 
Phe Asn Thr Leu Lys Leu Lys Gly Glu Asn Gly Val Asn He Thr Thr 
740 745 750 

aat aga gee aca ggt aca gtt acc ttt ggc att gac caa agt aat ggt 23 04 
Asn Arg Ala Thr Gly Thr Val Thr Phe Gly He Asp Gin Ser Asn Gly 
755 760 765 

etc acc acg cct aag ctg acc gtg ggt age gat aca aat ggt aat cga 2352 
Leu Thr Thr Pro Lys Leu Thr Val Gly Ser Asp Thr Asn Gly Asn Arg 
770 775 780 

ttg gtt att gag caa gtc cct age get gac ggt aac age acc aaa aac 24 0 0 
Leu Val He Glu Gin Val Pro Ser Ala Asp Gly Asn Ser Thr Lys Asn 
785 790 795 800 

ate att aaa gga ttg tec cca aca ctg cct age att gee agt cca agt 2448 
He He Lys Gly Leu Ser Pro Thr Leu Pro Ser He Ala Ser Pro Ser 
805 810 815 

ggc cgc aac ata gca ctg ggc aat aca ate gaa gaa aaa gac aaa tec 2496 
Gly Arg Asn He Ala Leu Gly Asn Thr He Glu Glu Lys Asp Lys Ser 
820 825 830 

aac get gee age att gat gat gtg eta aat gca ggc ttt aac eta aaa 2544 
Asn Ala Ala Ser He Asp Asp Val Leu Asn Ala Gly Phe Asn Leu Lys 
835 840 845 

aat aat ggc aaa gac aaa gac ttt gtc tec act tat gac act gtt gac 2592 
Asn Asn Gly Lys Asp Lys Asp Phe Val Ser Thr Tyr Asp Thr Val Asp 
850 855 860 

ttt ate gat ggc aat gee acc acc gee aca gta act tat gat gaa gee 2 64 0 
Phe He Asp Gly Asn Ala Thr Thr Ala Thr Val Thr Tyr Asp Glu Ala 
865 870 875 880 

aat caa acc agt aaa gtg gcg tat gat gtg aat gtg gat gag aaa acc 2 688 
Asn Gin Thr Ser Lys Val Ala Tyr Asp Val Asn Val Asp Glu Lys Thr 
885 890 895 

att gaa ctg aca ggc gat aat ggc aag aaa caa ctt ggc gtc aaa acc 2 736 
He Glu Leu Thr Gly Asp Asn Gly Lys Lys Gin Leu Gly Val Lys Thr 
900 905 910 

ate aaa ctg acc gaa aca agt act aat ggt aat gca act aca ttt agt 2784 
He Lys Leu Thr Glu Thr Ser Thr Asn Gly Asn Ala Thr Thr Phe Ser 
915 920 925 

acc gac gat gac cat gee ctt gtt aaa gee agt gat ate gee ggc aat 2832 



Thr Asp Asp Asp His Ala Leu Val Lys Ala Ser Asp He Ala Gly Asn 
930 935 940 



eta aac acc eta gec gag gaa att cac acc acc aaa ggc aca gca aac 2880 
Leu Asn Thr Leu Ala Glu Glu He His Thr Thr Lys Gly Thr Ala Asn 
945 950 955 960 

acc gec eta caa acc ttt acc gtt aaa aag gta gat gaa aat gat aag 2 92 8 
Thr Ala Leu Gin Thr Phe Thr Val Lys Lys Val Asp Glu Asn Asp Lys 
965 970 975 

get gat gac acc aac gee ate acc gtg ggt aaa gat ggc aca agt ggt 2976 
Ala Asp Asp Thr Asn Ala He Thr Val Gly Lys Asp Gly Thr Ser Gly 
980 985 990 

aaa gtc aac acc tta aaa etc aaa ggt aaa aac ggt ctt gat att aaa 3 024 
Lys Val Asn Thr Leu Lys Leu Lys Gly Lys Asn Gly Leu Asp He Lys 
995 1000 1005 

acc gac aaa gat ggt acg gtt acc ttt ggc att aac acc caa age ggt 3072 
Thr Asp Lys Asp Gly Thr Val Thr Phe Gly He Asn Thr Gin Ser Gly 
1010 1015 1020 

ctt aaa gee ggc gac age acc act eta aac aac aat ggc ttg tct att 312 0 
Leu Lys Ala Gly Asp Ser Thr Thr Leu Asn Asn Asn Gly Leu Ser He 
1025 1030 1035 1040 

aaa aac acc get agt aac gaa caa ate caa gtc ggt get gat ggc gtg 3168 
Lys Asn Thr Ala Ser Asn Glu Gin He Gin Val Gly Ala Asp Gly Val 
1045 1050 1055 

aag ttt gee atg gtt aat aat ggt gtt gta ggt get ggc att gat ggc 3216 
Lys Phe Ala Met Val Asn Asn Gly Val Val Gly Ala Gly He Asp Gly 
1060 1065 1070 

aca act cgc att acc aga gat gaa att ggc ttt act ggg act aat ggc 32 64 
Thr Thr Arg He Thr Arg Asp Glu He Gly Phe Thr Gly Thr Asn Gly 
1075 1080 1085 

tea ctt gat aaa age aaa ccc cac eta age aaa gac ggc att aac gca 3 312 
Ser Leu Asp Lys Ser Lys Pro His Leu Ser Lys Asp Gly He Asn Ala 
1090 1095 1100 

ggt ggt aaa aag att acc aac att caa tea ggt gag att gec aaa aac 33 60 
Gly Gly Lys Lys He Thr Asn He Gin Ser Gly Glu He Ala Lys Asn 
1105 1110 1115 1120 

age cat gat get gtg aca ggc ggc aag att tat gat tta aaa acc gaa 34 08 
Ser His Asp Ala Val Thr Gly Gly Lys He Tyr Asp Leu Lys Thr Glu 
1125 1130 1135 

ctt gaa aat aaa ate age agt act gec aaa aca gca caa aac tea tta 3456 
Leu Glu Asn Lys He Ser Ser Thr Ala Lys Thr Ala Gin Asn Ser Leu 
1140 1145 1150 

cac gaa ttc tea gta gca gat gaa caa ggt aat aac ttt acg gtt agt 3504 
His Glu Phe Ser Val Ala Asp Glu Gin Gly Asn Asn Phe Thr Val Ser 
1155 1160 1165 



aac cct tac tec agt tat gac acc tea aag acc tct gat gtc ate acc 3552 
Asn Pro Tyr Ser Ser Tyr Asp Thr Ser Lys Thr Ser Asp Val He Thr 
1170 1175 H80 

ttt gca ggt gaa aac ggc att acc acc aag gta aat aaa ggt gtg gtg 3 60 0 
Phe Ala Gly Glu Asn Gly He Thr Thr Lys Val Asn Lys Gly Val Val 
1185 1190 1195 1200 

cgt gtg ggc att gac caa acc aaa ggc tta acc acg cct aag ctg acc 3 64 8 
Arg Val Gly He Asp Gin Thr Lys Gly Leu Thr Thr Pro Lys Leu Thr 
1205 1210 1215 

gtg ggt aat aat aat ggc aaa ggc att gtc att aac age caa aat ggt 3696 
Val Gly Asn Asn Asn Gly Lys Gly He Val He Asn Ser Gin Asn Gly 
1220 1225 1230 

caa aat acc ate aca gga eta age aac act eta get aat gtt acc aat 3 744 
Gin Asn Thr He Thr Gly Leu Ser Asn Thr Leu Ala Asn Val Thr Asn 
1235 1240 1245 

gat aaa ggt age gta cgc acc aca gaa cag ggc aat ata ate aaa gac 3 7 92 
Asp Lys Gly Ser Val Arg Thr Thr Glu Gin Gly Asn He He Lys Asp 
1250 1255 1260 

gaa gac aaa acc cgt gec gee age att gtt gat gtg eta age gca ggc 3 84 0 
Glu Asp Lys Thr Arg Ala Ala Ser He Val Asp Val Leu Ser Ala Gly 
1265 1270 1275 1280 

ttt aac ttg caa ggc aat ggt gaa gcg gtt gac ttt gtc tec act tat 3888 
Phe Asn Leu Gin Gly Asn Gly Glu Ala Val Asp Phe Val Ser Thr Tyr 
1285 1290 1295 

gac acc gtc aac ttt gee aat ggc aat acc acc acc get aag gtg acc 3 93 6 
Asp Thr Val Asn Phe Ala Asn Gly Asn Thr Thr Thr Ala Lys Val Thr 
1300 1305 1310 

tat gat gac aca age aaa acc agt aaa gtg gtc tat gat gtc aat gtg 3 984 
Tyr Asp Asp Thr Ser Lys Thr Ser Lys Val Val Tyr Asp Val Asn Val 
1315 1320 1325 

gat gat aca acc att gaa gtt aaa gat aaa aaa ctt ggc gta aaa acc 4032 
Asp Asp Thr Thr He Glu Val Lys Asp Lys Lys Leu Gly Val Lys Thr 
1330 1335 1340 

acc aca ttg acc agt act ggc aca ggt get aat aaa ttt gee eta age 4080 
Thr Thr Leu Thr Ser Thr Gly Thr Gly Ala Asn Lys Phe Ala Leu Ser 
1345 1350 1355 1360 

aat caa get act ggc gat gcg ctt gtc aag gee agt gat ate gtt get 412 8 
Asn Gin Ala Thr Gly Asp Ala Leu Val Lys Ala Ser Asp He Val Ala 
1365 1370 1375 

cat eta aac acc tta tct ggc gac ate caa act gec aaa ggg gca age 4176 
His Leu Asn Thr Leu Ser Gly Asp He Gin Thr Ala Lys Gly Ala Ser 
1380 1385 1390 

caa gcg aac aac tea gca ggc tat gtg gat get gat ggc aat aag gtc 4224 
Gin Ala Asn Asn Ser Ala Gly Tyr Val Asp Ala Asp Gly Asn Lys Val 
1395 1400 1405 



ate tat gac agt acc gat aac aag tac tat caa gec aaa aat gat ggc 4272 
He Tyr Asp Ser Thr Asp Asn Lys Tyr Tyr Gin Ala Lys Asn Asp Gly 
1410 1415 1420 

aca gtt gat aaa acc aaa gaa gtt gec aaa gac aaa ctg gtc gec caa 432 0 
Thr Val Asp Lys Thr Lys Glu Val Ala Lys Asp Lys Leu Val Ala Gin 
1425 1430 1435 1440 

gec caa acc cca gat ggc aca ttg get caa atg aat gtc aaa tea gtc 4368 
Ala Gin Thr Pro Asp Gly Thr Leu Ala Gin Met Asn Val Lys Ser Val 
1445 1450 1455 

att aac aaa gaa caa gta aat gat gec aat aaa aag caa ggc ate aat 4416 
He Asn Lys Glu Gin Val Asn Asp Ala Asn Lys Lys Gin Gly He Asn 
1460 1465 1470 

gaa gac aac gec ttt gtt aaa gga ctt gaa aaa gec get tct gat aac 4464 
Glu Asp Asn Ala Phe Val Lys Gly Leu Glu Lys Ala Ala Ser Asp Asn 
1475 1480 1485 

aaa acc aaa aac gec gca gta act gtg ggt gat tta aat gec gtt gee 4512 
Lys Thr Lys Asn Ala Ala Val Thr Val Gly Asp Leu Asn Ala Val Ala 
1490 1495 1500 

caa aca ccg ctg acc ttt gca ggg gat aca ggc aca acg get aaa aaa 4560 
Gin Thr Pro Leu Thr Phe Ala Gly Asp Thr Gly Thr Thr Ala Lys Lys 
1505 1510 1515 1520 

ct 9 ggc gag act ttg acc ate aaa ggt ggg caa aca gac acc aat aag 4608 
Leu Gly Glu Thr Leu Thr He Lys Gly Gly Gin Thr Asp Thr Asn Lys 
1525 1530 1535 

eta acc gat aat aac ate ggt gtg gta gca ggt act gat ggc ttc act 4 656 
Leu Thr Asp Asn Asn He Gly Val Val Ala Gly Thr Asp Gly Phe Thr 
1540 1545 1550 

gtc aaa ctt gec aaa gac eta acc aat ctt aac age gtt aat gca ggt 4704 
Val Lys Leu Ala Lys Asp Leu Thr Asn Leu Asn Ser Val Asn Ala Gly 
1555 1560 1565 

ggc acc aaa att gat gaa aaa ggc ate tct ttt gta gac gca aac ggt 47 52 
Gly Thr Lys He Asp Glu Lys Gly He Ser Phe Val Asp Ala Asn Gly 
1570 1575 1580 

caa gee aaa gca aac acc cct gtg eta agt gec aat ggg ctg gac ctg 4800 
Gin Ala Lys Ala Asn Thr Pro Val Leu Ser Ala Asn Gly Leu Asp Leu 
1585 1590 1595 1600 

g9t ggc aag gtc ate agt aat gtg ggc aaa ggc aca aaa gat acc gac 4848 
Gly Gly Lys Val He Ser Asn Val Gly Lys Gly Thr Lys Asp Thr Asp 
1605 1610 1615 

get gec aat gta caa cag tta aac gaa gta cgc aac ttg ttg ggt ctt 4 8 96 
Ala Ala Asn Val Gin Gin Leu Asn Glu Val Arg Asn Leu Leu Gly Leu 
1620 1625 1630 

ggt aat gat aac get gac ggc aat cag gta aac att gec gac ate aaa 4 944 
Gly Asn Asp Asn Ala Asp Gly Asn Gin Val Asn He Ala Asp He Lys 



1635 



1640 



1645 



aaa gac cca aat tea ggt tea tea tct aac cgc act gtc ate aaa gca 4992 
Lys Asp Pro Asn Ser Gly Ser Ser Ser Asn Arg Thr Val He Lys Ala 
1650 1655 1660 

ggc acg gta ctt ggc ggt aaa ggt aat aac gat acc gaa aaa ctt gec 5040 
Gly Thr Val Leu Gly Gly Lys Gly Asn Asn Asp Thr Glu Lys Leu Ala 
1665 1670 1675 1680 

act ggt ggt gta caa gtg ggc gtg gat aaa gac ggc aac get aac ggc 50 8 8 
Thr Gly Gly Val Gin Val Gly Val Asp Lys Asp Gly Asn Ala Asn Gly 
1685 1690 1695 

gat tta age aat gtt tgg gtc aaa acc caa aaa gat ggc age aaa aaa 513 6 
Asp Leu Ser Asn Val Trp Val Lys Thr Gin Lys Asp Gly Ser Lys Lys 
1700 1705 1710 

gee ctg etc gee act tat aac gee gca ggt cag acc aac tat gtg acc 5184 
Ala Leu Leu Ala Thr Tyr Asn Ala Ala Gly Gin Thr Asn Tyr Val Thr 
1715 1720 1725 

aac aac ccc gca gaa gec att gac aga ata aat gaa caa ggt ate cgc 5232 
Asn Asn Pro Ala Glu Ala He Asp Arg He Asn Glu Gin Gly He Arg 
1730 1735 1740 

ttc ttc cat gtc aac gat ggc aat caa gag cct gtg gta caa ggg cgt 5280 
Phe Phe His Val Asn Asp Gly Asn Gin Glu Pro Val Val Gin Gly Arg 
1745 1750 1755 1760 

aac ggc att gac tea agt gec tea ggc aag cac tea gtg gcg ata ggt 532 8 
Asn Gly He Asp Ser Ser Ala Ser Gly Lys His Ser Val Ala He Gly 
1765 1770 1775 

ttc cag gee aag gca gat ggt gaa gec gee gtt gec ata ggc aga caa 53 76 
Phe Gin Ala Lys Ala Asp Gly Glu Ala Ala Val Ala He Gly Arg Gin 
1780 1785 1790 

acc caa gca ggc aac caa tec ate gec ate ggt gat aac gca caa gee 5424 
Thr Gin Ala Gly Asn Gin Ser He Ala He Gly Asp Asn Ala Gin Ala 
1795 1800 1805 

acg ggc gat caa tec ate gee ate ggt aca ggc aat gtg gta gca ggt 5472 
Thr Gly Asp Gin Ser He Ala He Gly Thr Gly Asn Val Val Ala Gly 
1810 1815 1820 

aag cac tct ggt gec ate ggc gac cca age act gtt aag get gat aac 552 0 
Lys His Ser Gly Ala He Gly Asp Pro Ser Thr Val Lys Ala Asp Asn 
1825 1830 1835 1840 

agt tac agt gtg ggt aat aac aac cag ttt acc gat gee act caa acc 5568 
Ser Tyr Ser Val Gly Asn Asn Asn Gin Phe Thr Asp Ala Thr Gin Thr 
1845 1850 1855 

gat gtc ttt ggt gtg ggc aat aac ate acc gtg acc gaa agt aac teg 5616 
Asp Val Phe Gly Val Gly Asn Asn He Thr Val Thr Glu Ser Asn Ser 
I860 1865 1870 

gtt gee tta ggt tea aac tct gec ate agt gca ggc aca cac gca ggc 5664 



Val Ala Leu Gly Ser Asn Ser Ala He Ser Ala Gly Thr His Ala Gly 
1875 1880 1885 



aca caa gcc aaa aaa tct gac ggc aca gca ggt aca acc acc aca gca 5712 
Thr Gin Ala Lys Lys Ser Asp Gly Thr Ala Gly Thr Thr Thr Thr Ala 
1890 1895 1900 

ggt gcc aca ggt acg gtt aaa ggc ttt get gga caa acg gcg gtt ggt 5760 
Gly Ala Thr Gly Thr Val Lys Gly Phe Ala Gly Gin Thr Ala Val Gly 
1905 1910 1915 1920 

gcg gtc tec gtg ggt gcc tea ggt get gaa cgc cgt ate caa aat gtg 5808 
Ala Val Ser Val Gly Ala Ser Gly Ala Glu Arg Arg He Gin Asn Val 
1925 1930 1935 

gca gca ggt gag gtc agt gcc acc age acc gat gcg gtc aat ggt age 5 856 
Ala Ala Gly Glu Val Ser Ala Thr Ser Thr Asp Ala Val Asn Gly Ser 
1940 1945 1950 

cag ttg tac aaa gcc acc caa age att gcc aac gca acc aat gag ctt 5904 
Gin Leu Tyr Lys Ala Thr Gin Ser He Ala Asn Ala Thr Asn Glu Leu 
1955 I960 1965 

gac cat cgt ate cac caa aac gaa aat aaa gcc aat gca ggg att tea 5952 
Asp His Arg He His Gin Asn Glu Asn Lys Ala Asn Ala Gly He Ser 
1970 1975 1980 

tea gcg atg gcg atg gcg tec atg cca caa gcc tac att cct ggc aga 6000 
Ser Ala Met Ala Met Ala Ser Met Pro Gin Ala Tyr lie Pro Gly Arg 
1985 1990 1995 2000 

tec atg gtt acc ggg ggt att gcc acc cac aac ggt caa ggt gcg gtg 604 8 
Ser Met Val Thr Gly Gly He Ala Thr His Asn Gly Gin Gly Ala Val 
2005 2010 2015 

gca gtg gga ctg teg aag ctg teg gat aat ggt caa tgg gta ttt aaa 6096 
Ala Val Gly Leu Ser Lys Leu Ser Asp Asn Gly Gin Trp Val Phe Lys 
2020 2025 2030 

ate aat ggt tea gcc gat acc caa ggc cat gta ggg gcg gca gtt ggt 6144 
He Asn Gly Ser Ala Asp Thr Gin Gly His Val Gly Ala Ala Val Gly 
2035 2040 2045 

gca ggt ttt cac ttt 6159 
Ala Gly Phe His Phe 
2050 



Figure 5. Moraxella catarrhalis lesl 200kDa 



ATG aat cac ate tat aaa gtc ate ttt aac aaa gee aca ggc aca ttt 4 8 
Met Asn His lie Tyr Lys Val lie Phe Asn Lys Ala Thr Gly Thr Phe 
15 10 15 

atg gec gtg gca gag tgc gec aaa tec cac age gga ggg agt age agt 96 
Met Ala Val Ala Glu Cys Ala Lys Ser His Ser Gly Gly Ser Ser Ser 
20 25 30 

agt acc gca gga cag gtg ggc age tct cct gtc ate cgc ctg act cgt 144 
Ser Thr Ala Gly Gin Val Gly Ser Ser Pro Val lie Arg Leu Thr Arg 
35 40 45 

gtt gec acg etc get ate eta gtg ate ggt gcg acg etc aat ggc agt 192 
Val Ala Thr Leu Ala lie Leu Val He Gly Ala Thr Leu Asn Gly Ser 
50 55 60 

get tat get caa aat aat age aag ate gca ttt ggt acc aca ggc aac 240 
Ala Tyr Ala Gin Asn Asn Ser Lys He Ala Phe Gly Thr Thr Gly Asn 
65 70 75 80 

aat gac aat gec teg get age aat gaa gca tec att get att ggt agt 288 
Asn Asp Asn Ala Ser Ala Ser Asn Glu Ala Ser He Ala He Gly Ser 
85 90 95 

ctt get aag gca cat gec aat caa get att get ate ggt ggt age aaa 336 
Leu Ala Lys Ala His Ala Asn Gin Ala He Ala He Gly Gly Ser Lys 
100 105 110 

cca gat cct cgt aat caa gcg get aat cag aag gca ggt tec cac gec 3 84 
Pro Asp Pro Arg Asn Gin Ala Ala Asn Gin Lys Ala Gly Ser His Ala 
115 120 125 

aaa ggt aaa gag tec ate gee ate ggt ggt gat gta ctg get gag ggt 432 
Lys Gly Lys Glu Ser He Ala He Gly Gly Asp Val Leu Ala Glu Gly 
130 135 140 

gat gec teg att gec att ggt agt gat gac tta tat ttg gat agg aat 4 80 
Asp Ala Ser He Ala He Gly Ser Asp Asp Leu Tyr Leu Asp Arg Asn 
145 150 155 160 

age act aac tct aaa tat cca aat ggt ctt ctt age act ctt att caa 52 8 
Ser Thr Asn Ser Lys Tyr Pro Asn Gly Leu Leu Ser Thr Leu He Gin 
165 170 175 

aac cat aca gta tta cgc caa ata cga gac tea aat ggt tct cag aaa 576 
Asn His Thr Val Leu Arg Gin He Arg Asp Ser Asn Gly Ser Gin Lys 
180 185 190 

tat aga cgc aca gca gca gaa gga cac gec agt act gca gtg gga gee 624 
Tyr Arg Arg Thr Ala Ala Glu Gly His Ala Ser Thr Ala Val Gly Ala 
195 200 205 



atg gca tat gca aag ggt cat ttt gee aac gee ttt ggt aca egg tea 672 
Met Ala Tyr Ala Lys Gly His Phe Ala Asn Ala Phe Gly Thr Arg Ser 



210 



215 



220 



aca get gaa ggc aac tat tec ttg gca gta ggt ctt acc gee aaa gec 72 0 
Thr Ala Glu Gly Asn Tyr Ser Leu Ala Val Gly Leu Thr Ala Lys Ala 
225 230 235 240 

gaa aaa gga tat aca ate get att ggt tct aat gca caa get ate aat 768 
Glu Lys Gly Tyr Thr He Ala He Gly Ser Asn Ala Gin Ala He Asn 
245 250 255 

tat gga gca eta gec ctt ggt gca gat act cga gtt gat ttg gat tac 816 
Tyr Gly Ala Leu Ala Leu Gly Ala Asp Thr Arg Val Asp Leu Asp Tyr 
260 265 270 

ggt att gec eta ggt tat ggt tct cag ate ctt aat aat aat aat aat 864 
Gly He Ala Leu Gly Tyr Gly Ser Gin He Leu Asn Asn Asn Asn Asn 
275 280 285 

aat aat aat aaa gec tat gta cca gaa ggt aat ggg tea aac ata aaa 912 
Asn Asn Asn Lys Ala Tyr Val Pro Glu Gly Asn Gly Ser Asn He Lys 
290 295 300 

teg tct aaa gec acc ggc aat ggt tta ttt tec att ggt agt age act 960 
Ser Ser Lys Ala Thr Gly Asn Gly Leu Phe Ser He Gly Ser Ser Thr 
305 310 315 320 

ate aag cgt aaa ate ate aat gtc ggt gca ggt tat gag gat acc gat 10 08 
He Lys Arg Lys He He Asn Val Gly Ala Gly Tyr Glu Asp Thr Asp 
325 330 335 

gcg gtc aat gtg gca cag eta aaa gcg gtg gag aat ctg get aag cgt 105 6 
Ala Val Asn Val Ala Gin Leu Lys Ala Val Glu Asn Leu Ala Lys Arg 
340 345 350 

caa att act ttt aag ggt gat gat aac ggt act ggc gtt aag aaa aaa 1104 
Gin He Thr Phe Lys Gly Asp Asp Asn Gly Thr Gly Val Lys Lys Lys 
355 360 365 

ctg ggc gag act tta acc att aaa ggt ggt gag acc caa gcg gac aag 1152 
Leu Gly Glu Thr Leu Thr He Lys Gly Gly Glu Thr Gin Ala Asp Lys 
370 375 380 

eta acc gat aat aat aac att ggt gtg gta aca gat aat aat act ggt 1200 
Leu Thr Asp Asn Asn Asn He Gly Val Val Thr Asp Asn Asn Thr Gly 
385 390 395 400 

ctg aaa gtt aaa ctt get aaa aac eta age ggt ctt gaa aca gtt age 1248 
Leu Lys Val Lys Leu Ala Lys Asn Leu Ser Gly Leu Glu Thr Val Ser 
405 410 415 

acc aaa aac eta acc gec age gag aaa gtt acg gta ggt agt ggt aat 12 96 
Thr Lys Asn Leu Thr Ala Ser Glu Lys Val Thr Val Gly Ser Gly Asn 
420 425 430 

aac acc get gag eta caa age ggt ggt tta acc ttt acc cca aca aca 1344 
Asn Thr Ala Glu Leu Gin Ser Gly Gly Leu Thr Phe Thr Pro Thr Thr 
435 440 445 



aat gca age aca gac aaa acc gtc tat ggc act gat ggg ctt aag ttt 13 92 
Asn Ala Ser Thr Asp Lys Thr Val Tyr Gly Thr Asp Gly Leu Lys Phe 
450 455 460 

act gat aat tct aat acg gca ctt gaa gat act act cgt ate acc aaa 1440 
Thr Asp Asn Ser Asn Thr Ala Leu Glu Asp Thr Thr Arg lie Thr Lys 
465 470 475 480 

gat aaa att ggt ttt age aat aaa get ggt aca gtt gat gaa aac aaa 14 88 
Asp Lys lie Gly Phe Ser Asn Lys Ala Gly Thr Val Asp Glu Asn Lys 
485 490 495 

cct tat ctt gat aaa gac aag eta aaa gtt ggc aac age acc eta aac 1536 
Pro Tyr Leu Asp Lys Asp Lys Leu Lys Val Gly Asn Ser Thr Leu Asn 
500 505 510 

aac ggt ggc ttg act gtt aat aac acc att ggt ggt age aat aaa caa 1584 
Asn Gly Gly Leu Thr Val Asn Asn Thr lie Gly Gly Ser Asn Lys Gin 
515 520 525 

ate caa gtc ggt get gat ggc att aaa ttt gee gat gtg aat gtt aat 1632 
lie Gin Val Gly Ala Asp Gly lie Lys Phe Ala Asp Val Asn Val Asn 
530 535 540 

gta tea aat gee gca aaa ttc ggc act act cgt att acc gaa gag gaa 1680 
Val Ser Asn Ala Ala Lys Phe Gly Thr Thr Arg He Thr Glu Glu Glu 
545 550 555 560 

att ggc ttt get gat get gat ggt aaa gtt gat aaa aag tea cca tat 172 8 
He Gly Phe Ala Asp Ala Asp Gly Lys Val Asp Lys Lys Ser Pro Tyr 
565 570 575 

ttg gat aaa aaa caa ctt caa gtg ggt ggt gtt aaa att acc aaa gac 177 6 
Leu Asp Lys Lys Gin Leu Gin Val Gly Gly Val Lys He Thr Lys Asp 
580 585 590 

agt ggc att aat gca ggt gat caa aag ate agt aat gtt aaa gat gca 1824 
Ser Gly He Asn Ala Gly Asp Gin Lys He Ser Asn Val Lys Asp Ala 
595 600 605 

acg gac gat acc gat gca gtc act tat aaa cag ctt aaa caa gtc caa 18 72 
Thr Asp Asp Thr Asp Ala Val Thr Tyr Lys Gin Leu Lys Gin Val Gin 
610 615 620 

caa gac gee gac ggt gee eta caa age ttc tct att cgt gat gaa aaa 192 0 
Gin Asp Ala Asp Gly Ala Leu Gin Ser Phe Ser He Arg Asp Glu Lys 
625 630 635 640 

ggt cag gaa ttt acg att agt aac ttg tat tct aat ggt aat acc cca 1968 
Gly Gin Glu Phe Thr He Ser Asn Leu Tyr Ser Asn Gly Asn Thr Pro 
645 650 655 

aat acc ttt gag acc ate acc ttt gca ggt gaa aac ggc ate agt ate 2016 
Asn Thr Phe Glu Thr He Thr Phe Ala Gly Glu Asn Gly He Ser He 
660 665 670 



age aat gac ata gec aaa ggt aaa gtc aaa gtt ggt att gac cca ate 2064 
Ser Asn Asp lie Ala Lys Gly Lys Val Lys Val Gly lie Asp Pro lie 
675 680 685 

aat ggt etc acc acg cct aag ctg acc gtg ggt age gat aaa gat ggt 2112 
Asn Gly Leu Thr Thr Pro Lys Leu Thr Val Gly Ser Asp Lys Asp Gly 
690 695 700 

aaa act caa ttg gtt att gag caa gtg get age ggt aac gac acc aaa 2160 
Lys Thr Gin Leu Val He Glu Gin Val Ala Ser Gly Asn Asp Thr Lys 
705 710 715 720 

aac ate att aga gga ttg tec cca aca ctg cct age att acc aat gca 2208 
Asn He He Arg Gly Leu Ser Pro Thr Leu Pro Ser He Thr Asn Ala 
725 730 735 

ggt 99 c gta cgc acc aca gaa cag ggc aat aca ate acc age gac gaa 225 6 
Gly Gly Val Arg Thr Thr Glu Gin Gly Asn Thr He Thr Ser Asp Glu 
740 745 750 

gac aaa tec aaa gee gee agt ate ggt gat ata tta aat aca ggc ttt 2 3 04 
Asp Lys Ser Lys Ala Ala Ser He Gly Asp He Leu Asn Thr Gly Phe 
755 760 765 

aac eta aaa aat aat age aac tec gtt ggc ttt gtc tec act tat aac 2352 
Asn Leu Lys Asn Asn Ser Asn Ser Val Gly Phe Val Ser Thr Tyr Asn 
770 775 780 

act gtt gac ttt ate gat ggc aat gee acc acc get aag gta act tac 2400 
Thr Val Asp Phe He Asp Gly Asn Ala Thr Thr Ala Lys Val Thr Tyr 
785 790 795 800 

gat gaa acc aat caa acc agt aaa gta act tat gat gtc aat gtg gat 2448 
Asp Glu Thr Asn Gin Thr Ser Lys Val Thr Tyr Asp Val Asn Val Asp 
805 810 815 

gag aaa acc att gaa etc aca ggc gat aat ggc aag aca aac aaa att 24 96 
Glu Lys Thr He Glu Leu Thr Gly Asp Asn Gly Lys Thr Asn Lys He 
820 825 830 

ggc gtc aaa acc acc aca ctg acc aca aca aat get aat ggt aaa gca 2544 
Gly Val Lys Thr Thr Thr Leu Thr Thr Thr Asn Ala Asn Gly Lys Ala 
835 840 845 

acc aac ttt agt acc acc gat aac gat gee ctt gtt aac gec aaa gac 2592 
Thr Asn Phe Ser Thr Thr Asp Asn Asp Ala Leu Val Asn Ala Lys Asp 
850 855 860 

ate gee gaa aat eta aac acc eta gec aag gaa att cac acc acc aaa 2640 
He Ala Glu Asn Leu Asn Thr Leu Ala Lys Glu He His Thr Thr Lys 
865 870 875 880 

ggc aca gca gac acc gee eta caa acc ttt aaa gtc aaa aaa gac ggt 2688 
Gly Thr Ala Asp Thr Ala Leu Gin Thr Phe Lys Val Lys Lys Asp Gly 
885 890 895 

gca act gat gac gaa acc ate acc gtg ggt aaa gat ggt aca caa aac 2 73 6 



Ala Thr Asp Asp Glu Thr He Thr Val Gly Lys Asp Gly Thr Gin Asn 
900 905 910 



ggc aag acc gtc aac act eta aaa etc aaa ggt gaa aac ggt eta acg 2784 
Gly Lys Thr Val Asn Thr Leu Lys Leu Lys Gly Glu Asn Gly Leu Thr 
915 920 925 

gtt get acc aat aaa gat ggt acg gtt acc ttt ggc att aac acc caa 2 832 
Val Ala Thr Asn Lys Asp Gly Thr Val Thr Phe Gly He Asn Thr Gin 
930 935 940 

a g c ggt ctt aaa gee ggc gac age acc act eta aac aaa gat ggc ttg 2 880 
Ser Gly Leu Lys Ala Gly Asp Ser Thr Thr Leu Asn Lys Asp Gly Leu 
945 950 955 960 

tct att aaa aac ccc get agt aac gaa caa ate caa gtc ggt get gat 2 92 8 
Ser He Lys Asn Pro Ala Ser Asn Glu Gin He Gin Val Gly Ala Asp 
965 970 975 

ggc gtg aag ttt gee aag gtt gat aag ggt aat tea age act ggc att 2976 
Gly Val Lys Phe Ala Lys Val Asp Lys Gly Asn Ser Ser Thr Gly He 
980 985 990 

gat ggc aca age cgt ate acc aaa gat caa att ggc ttt act ggg get 3 024 
Asp Gly Thr Ser Arg He Thr Lys Asp Gin He Gly Phe Thr Gly Ala 
995 1000 1005 

aat ggc tea ctt gat acc acc aaa ccc cac eta acc aaa gac aag ctt 3072 
Asn Gly Ser Leu Asp Thr Thr Lys Pro His Leu Thr Lys Asp Lys Leu 
1010 1015 1020 

aaa gtg ggt gaa gtt gaa att acc aac act ggc att aac gca ggt ggt 312 0 
Lys Val Gly Glu Val Glu He Thr Asn Thr Gly He Asn Ala Gly Gly 
1025 1030 1035 1040 

aaa aag att acc aac att caa tea ggt gat att acc caa aac age aat 3168 
Lys Lys He Thr Asn He Gin Ser Gly Asp He Thr Gin Asn Ser Asn 
1045 1050 1055 

gat get gtg aca ggc ggt egg gtt tat gat tta aaa acc gaa ctt gaa 3216 
Asp Ala Val Thr Gly Gly Arg Val Tyr Asp Leu Lys Thr Glu Leu Glu 
1060 1065 1070 

age aaa ate aac agt get get aaa aca gca caa aac tea tta cac gaa 3264 
Ser Lys He Asn Ser Ala Ala Lys Thr Ala Gin Asn Ser Leu His Glu 
1075 1080 1085 

ttc tea gta gca gat gaa caa ggt aat cac ttt acg gtt agt aac cct 3 312 
Phe Ser Val Ala Asp Glu Gin Gly Asn His Phe Thr Val Ser Asn Pro 
1090 1095 1100 

tac tec agt tat gac acc tea aag acc tct gat gtc ate acc ttt gca 3360 
Tyr Ser Ser Tyr Asp Thr Ser Lys Thr Ser Asp Val He Thr Phe Ala 
1105 1110 1115 H20 

ggt gaa aac ggc att acc acc aag gta aat aaa ggt gtg gtg cgt gtg 3408 
Gly Glu Asn Gly He Thr Thr Lys Val Asn Lys Gly Val Val Arg Val 



1125 



1130 



1135 



ggc att gac caa acc aaa ggc tta acc acg cct aag ctg acc gtg ggt 3456 
Gly lie Asp Gin Thr Lys Gly Leu Thr Thr Pro Lys Leu Thr Val Gly 
1140 1145 1150 

aat aat aat ggc aaa ggc att gtc att gac agt aaa gat ggt caa aat 3504 
Asn Asn Asn Gly Lys Gly He Val He Asp Ser Lys Asp Gly Gin Asn 
1155 1160 1165 

acc ate aca gga eta age aac act eta get aat gtt acc aat gat ggt 3 552 
Thr He Thr Gly Leu Ser Asn Thr Leu Ala Asn Val Thr Asn Asp Gly 
1170 1175 1180 

gca gga cac gca eta age caa ggg ctt gec aat gac acc gac aaa acc 3600 
Ala Gly His Ala Leu Ser Gin Gly Leu Ala Asn Asp Thr Asp Lys Thr 
1185 1190 1195 1200 

cgt gec gec age att ggt gat gtg eta aac gca ggc ttt aac ttg caa 3648 
Arg Ala Ala Ser He Gly Asp Val Leu Asn Ala Gly Phe Asn Leu Gin 
1205 1210 1215 

ggc aat ggt gaa gcg gtt gac ttt gtc tec act tat gac act gtt gac 3 696 
Gly Asn Gly Glu Ala Val Asp Phe Val Ser Thr Tyr Asp Thr Val Asp 
1220 1225 1230 

ttt ate gat ggc aat gec acc acc get aag gtg acc tat gat gac aca 3 744 
Phe He Asp Gly Asn Ala Thr Thr Ala Lys Val Thr Tyr Asp Asp Thr 
1235 1240 1245 

age aaa acc agt aaa gtg gtc tat gat gtc aat gtg gat aat aaa acc 3 792 
Ser Lys Thr Ser Lys Val Val Tyr Asp Val Asn Val Asp Asn Lys Thr 
1250 1255 1260 

att gaa gtg aca agt gat aaa aaa ctt ggc gtc aaa acc acc aca ctg 3 84 0 
He Glu Val Thr Ser Asp Lys Lys Leu Gly Val Lys Thr Thr Thr Leu 
1265 1270 1275 1280 

acc aaa aca agt get aat ggt aat gca acc aaa ttt agt gec gee gat 3888 
Thr Lys Thr Ser Ala Asn Gly Asn Ala Thr Lys Phe Ser Ala Ala Asp 
1285 1290 1295 

ggc gat gee ctt gtt aaa gee agt gat ate gec acc cat eta aat acc 3 93 6 
Gly Asp Ala Leu Val Lys Ala Ser Asp He Ala Thr His Leu Asn Thr 
1300 1305 1310 

ttg get ggc gac ate caa acc gee aaa ggg gca age caa gca age age 3 984 
Leu Ala Gly Asp He Gin Thr Ala Lys Gly Ala Ser Gin Ala Ser Ser 
1315 1320 1325 

tea gca age tat gtg gat get gat ggc aac aag gtc ate tat gac agt 4 032 
Ser Ala Ser Tyr Val Asp Ala Asp Gly Asn Lys Val He Tyr Asp Ser 
1330 1335 1340 

acc gat aag aag tac tat caa gtc aat gac aag ggt caa gtg gac aaa 4 08 0 
Thr Asp Lys Lys Tyr Tyr Gin Val Asn Asp Lys Gly Gin Val Asp Lys 
1345 1350 1355 1360 



aac aaa gaa gtt gcc aaa gac aaa ctg gtc gcc caa gcc caa acc cca 412 8 
Asn Lys Glu Val Ala Lys Asp Lys Leu Val Ala Gin Ala Gin Thr Pro 
1365 1370 1375 

gat ggc aca ttg get caa atg aat gtc aaa tea gtc att aac aaa gag 4176 
Asp Gly Thr Leu Ala Gin Met Asn Val Lys Ser Val lie Asn Lys Glu 
1380 1385 1390 

caa gta aat gat gcc aat aaa aag caa ggc ate aat gaa gac aac gcc 4224 
Gin Val Asn Asp Ala Asn Lys Lys Gin Gly lie Asn Glu Asp Asn Ala 
1395 1400 1405 

ttt ate aaa ggg ctt gaa aac gcc gcc aaa gac acc aaa acc aaa aac 4272 
Phe lie Lys Gly Leu Glu Asn Ala Ala Lys Asp Thr Lys Thr Lys Asn 
1410 1415 1420 

gcc gca gta act gtg ggt gat tta aat gcc gtt gcc caa aca ccg ctg 4320 
Ala Ala Val Thr Val Gly Asp Leu Asn Ala Val Ala Gin Thr Pro Leu 
1425 1430 1435 1440 

acc ttt gca ggg gat aca ggc aca acg get aaa aaa ctg ggc gag act 43 68 
Thr Phe Ala Gly Asp Thr Gly Thr Thr Ala Lys Lys Leu Gly Glu Thr 
1445 1450 1455 

ttg acc ate aaa ggt ggg caa aca gac acc aat aag eta acc gat aat 4416 
Leu Thr lie Lys Gly Gly Gin Thr Asp Thr Asn Lys Leu Thr Asp Asn 
1460 1465 1470 

aac ate ggt gtg gta gca ggt act gat ggc ttc act gtc aaa ctt gcc 4464 
Asn lie Gly Val Val Ala Gly Thr Asp Gly Phe Thr Val Lys Leu Ala 
1475 1480 1485 

aaa gac eta acc aat ctt aac age gtt aat gca ggt ggc acc aga att 4512 
Lys Asp Leu Thr Asn Leu Asn Ser Val Asn Ala Gly Gly Thr Arg lie 
1490 1495 1500 

gat gaa aaa ggc ate tct ttt gta gac gca aac ggt caa gcc aaa gca 4560 
Asp Glu Lys Gly lie Ser Phe Val Asp Ala Asn Gly Gin Ala Lys Ala 
1505 1510 1515 1520 

aac acc cct gtg eta agt gcc aat ggg ctg gac ctg ggt ggc aaa cgc 4608 
Asn Thr Pro Val Leu Ser Ala Asn Gly Leu Asp Leu Gly Gly Lys Arg 
1525 1530 1535 

ate agt aac ate ggt gca get gtt gat gat aac gat gcg gtg aac ttt 4656 
lie Ser Asn lie Gly Ala Ala Val Asp Asp Asn Asp Ala Val Asn Phe 
1540 1545 1550 

aag cag ttt aat gaa gtt gcc aaa acg gtc aac aac eta aac aac caa 47 04 
Lys Gin Phe Asn Glu Val Ala Lys Thr Val Asn Asn Leu Asn Asn Gin 
1555 1560 1565 

agt aac tea ggt gcg tea tta ccc ttt gtg gta acc gat gcc aat ggc 4752 
Ser Asn Ser Gly Ala Ser Leu Pro Phe Val Val Thr Asp Ala Asn Gly 
1570 1575 1580 



aag ccc ate aat ggc acc gat ggc aag ccc caa aaa gec ate aag ggc 480 0 
Lys Pro He Asn Gly Thr Asp Gly Lys Pro Gin Lys Ala He Lys Gly 
1585 1590 1595 1600 

gec gat ggt aaa tac tat cac gec aac gec aac ggc gta cct gtg gac 4 84 8 
Ala Asp Gly Lys Tyr Tyr His Ala Asn Ala Asn Gly Val Pro Val Asp 
1605 1610 1615 

aaa gat ggc aag ccc ate acc gat gcg gac aaa ctt gec aat ctg gca 4896 
Lys Asp Gly Lys Pro He Thr Asp Ala Asp Lys Leu Ala Asn Leu Ala 
1620 1625 1630 

get cat ggc aaa ccc ctt gat gca ggt cat caa gtg gtg gca age eta 4944 
Ala His Gly Lys Pro Leu Asp Ala Gly His Gin Val Val Ala Ser Leu 
1635 1640 1645 

ggc ggc aac tea gat gec ate acc eta acc aac ate aag tec act ttg 4 992 
Gly Gly Asn Ser Asp Ala He Thr Leu Thr Asn He Lys Ser Thr Leu 
1650 1655 1660 

cca caa att gac aca cca aac aca ggt aat gec aat gca ggg caa gec 5040 
Pro Gin He Asp Thr Pro Asn Thr Gly Asn Ala Asn Ala Gly Gin Ala 
1665 1670 1675 1680 

caa agt ctg ccc age eta tea gca gca cag caa agt aat get gee agt 5088 
Gin Ser Leu Pro Ser Leu Ser Ala Ala Gin Gin Ser Asn Ala Ala Ser 
1685 1690 1695 

gtc aaa gat gtg eta aat gta ggc ttt aac ttg cag acc aat cac aat 5136 
Val Lys Asp Val Leu Asn Val Gly Phe Asn Leu Gin Thr Asn His Asn 
1700 1705 1710 

caa gtg gac ttt gtc aaa gee tat gat acc gtc aac ttt gtc aat ggt 5184 
Gin Val Asp Phe Val Lys Ala Tyr Asp Thr Val Asn Phe Val Asn Gly 
1715 1720 1725 

aca ggt gee gac ate aca age gtg cgt agt get gat ggc acg atg agt 523 2 
Thr Gly Ala Asp He Thr Ser Val Arg Ser Ala Asp Gly Thr Met Ser 
1730 1735 1740 

aac ate acc gtc aac acc gee tta gca gcg acc gat gat gat ggc aat 52 8 0 
Asn He Thr Val Asn Thr Ala Leu Ala Ala Thr Asp Asp Asp Gly Asn 
1745 1750 1755 1760 

gtg ctt ate aaa gec aaa gat ggt aag ttc tac aaa gca gac gac etc 5328 
Val Leu He Lys Ala Lys Asp Gly Lys Phe Tyr Lys Ala Asp Asp Leu 
1765 1770 1775 

atg cca aac ggc tea eta aaa gca ggc aaa tea gec agt gat gec aaa 53 76 
Met Pro Asn Gly Ser Leu Lys Ala Gly Lys Ser Ala Ser Asp Ala Lys 
1780 1785 1790 

act cca act ggt eta age ctt gtt aac ccc aat get ggt aaa ggc agt 5424 
Thr Pro Thr Gly Leu Ser Leu Val Asn Pro Asn Ala Gly Lys Gly Ser 
1795 1800 1805 

aca ggc gat gca gtg get ctt aat aac tta tea aaa gcg gta ttt aaa 5472 



Thr Gly Asp Ala Val Ala Leu Asn Asn Leu Ser Lys Ala Val Phe Lys 
1810 1815 1820 



tec aaa gat ggt aca act act acc aca gta age tct gat ggc ate agt 5520 
Ser Lys Asp Gly Thr Thr Thr Thr Thr Val Ser Ser Asp Gly He Ser 
1825 1830 1835 1840 

ate caa ggc aaa gat aac age age ate acc eta age aaa gat ggg ctg 5568 
He Gin Gly Lys Asp Asn Ser Ser He Thr Leu Ser Lys Asp Gly Leu 
1845 1850 1855 

aat gta ggc ggt aag gtc ate age aat gtg ggt aaa ggc aca aaa gac 5616 
Asn Val Gly Gly Lys Val He Ser Asn Val Gly Lys Gly Thr Lys Asp 
1860 1865 1870 

acc gac get gec aat gta caa cag tta aac gaa gta cgc aac ttg ttg 5664 
Thr Asp Ala Ala Asn Val Gin Gin Leu Asn Glu Val Arg Asn Leu Leu 
1875 1880 1885 

ggt ctt ggt aat get ggt aat gat aac get gac ggc aat cag gta aac 5712 
Gly Leu Gly Asn Ala Gly Asn Asp Asn Ala Asp Gly Asn Gin Val Asn 
1890 1895 1900 

att gee gac ate aaa aaa gac cca aat tea ggt tea tea tct aac cgc 5760 
He Ala Asp He Lys Lys Asp Pro Asn Ser Gly Ser Ser Ser Asn Arg 
1905 1910 1915 1920 

act gtc ate aaa gca ggc acg gta ctt ggc ggt aaa ggt aat aac gat 58 08 
Thr Val He Lys Ala Gly Thr Val Leu Gly Gly Lys Gly Asn Asn Asp 
1925 1930 1935 

acc gaa aaa ctt gee act ggt ggt gta caa gtg ggc gtg gat aaa gac 5856 
Thr Glu Lys Leu Ala Thr Gly Gly Val Gin Val Gly Val Asp Lys Asp 
1940 1945 1950 

ggc aac get aac ggc gat tta age aat gtt tgg gtc aaa acc caa aaa 5904 
Gly Asn Ala Asn Gly Asp Leu Ser Asn Val Trp Val Lys Thr Gin Lys 
1955 1960 1965 

gat ggc age aaa aaa gee ctg etc gec act tat aac gec gca ggt cag 5952 
Asp Gly Ser Lys Lys Ala Leu Leu Ala Thr Tyr Asn Ala Ala Gly Gin 
1970 1975 1980 

acc aac tat ttg acc aac aac ccc gca gaa gec att gac aga ata aat 60 00 
Thr Asn Tyr Leu Thr Asn Asn Pro Ala Glu Ala He Asp Arg He Asn 
1985 1990 1995 2000 

gaa caa ggt ate cgc ttc ttc cat gtc aac gat ggc aat caa gag cct 6048 
Glu Gin Gly He Arg Phe Phe His Val Asn Asp Gly Asn Gin Glu Pro 
2005 2010 2015 

gtg gta caa ggg cgt aac ggc att gac tea agt gec tea ggc aag cac 60 96 
Val Val Gin Gly Arg Asn Gly He Asp Ser Ser Ala Ser Gly Lys His 
2020 2025 2030 

tea gtg gcg ata ggt ttc cag gee aag gca gat ggt gaa gee gec gtt 6144 
Ser Val Ala He Gly Phe Gin Ala Lys Ala Asp Gly Glu Ala Ala Val 



2035 



2040 



2045 



gcc ata ggc aga caa acc caa gca ggc aac caa tec ate gec ate ggt 6192 
Ala lie Gly Arg Gin Thr Gin Ala Gly Asn Gin Ser lie Ala lie Gly 
2050 2055 2060 

gat aac gca caa gcc acg ggc gat caa tec ate gcc ate ggt aca ggc 6240 
Asp Asn Ala Gin Ala Thr Gly Asp Gin Ser lie Ala lie Gly Thr Gly 
2065 2070 2075 2080 

aat gtg gta aca ggt aag cac tct ggt gcc ate ggc gac cca age act 62 8 8 
Asn Val Val Thr Gly Lys His Ser Gly Ala lie Gly Asp Pro Ser Thr 
2085 2090 2095 

gtt aag get gat aac agt tac agt gtg ggt aat aac aac cag ttt ate 6336 
Val Lys Ala Asp Asn Ser Tyr Ser Val Gly Asn Asn Asn Gin Phe lie 
2100 2105 2110 

gat gcc act cag acc gat gtc ttt ggt gtg ggc aat aac ate acc gtg 63 84 
Asp Ala Thr Gin Thr Asp Val Phe Gly Val Gly Asn Asn He Thr Val 
2115 2120 2125 

acc gaa agt aac teg gtt gcc tta ggt tea aac tct gcc ate agt gca 6432 
Thr Glu Ser Asn Ser Val Ala Leu Gly Ser Asn Ser Ala He Ser Ala 
2130 2135 2140 

ggc aca cac gca ggc aca caa gcc aaa aaa tct gac ggc aca gca ggt 6480 
Gly Thr His Ala Gly Thr Gin Ala Lys Lys Ser Asp Gly Thr Ala Gly 
2145 2150 2155 2160 

aca acc acc aca gca ggt gca aca ggt acg gtt aaa ggc ttt get gga 652 8 
Thr Thr Thr Thr Ala Gly Ala Thr Gly Thr Val Lys Gly Phe Ala Gly 
2165 2170 2175 

caa acg gcg gtt ggt gcg gtc tec gtg ggt gcc tea ggt get gaa cgc 657 6 
Gin Thr Ala Val Gly Ala Val Ser Val Gly Ala Ser Gly Ala Glu Arg 
2180 2185 2190 

cgt ate caa aat gtg gca gca ggt gag gtc agt gcc acc age acc gat 6624 
Arg He Gin Asn Val Ala Ala Gly Glu Val Ser Ala Thr Ser Thr Asp 
2195 2200 2205 

gcg gtc aat ggt age cag ttg tac aaa gcc acc caa ggc att gcc aac 6672 
Ala Val Asn Gly Ser Gin Leu Tyr Lys Ala Thr Gin Gly He Ala Asn 
2210 2215 2220 

gca acc aat gag ctt gac cat cgt ate cac caa aac gaa aat aaa gcc 672 0 
Ala Thr Asn Glu Leu Asp His Arg He His Gin Asn Glu Asn Lys Ala 
2225 2230 2235 2240 

aat gca ggg att tea tea gcg atg gcg atg gcg tec atg cca caa gcc 6768 
Asn Ala Gly He Ser Ser Ala Met Ala Met Ala Ser Met Pro Gin Ala 
2245 2250 2255 

tac att cct ggc aga tec atg gtt acc ggg ggt att gcc acc cac aac 6816 
Tyr He Pro Gly Arg Ser Met Val Thr Gly Gly He Ala Thr His Asn 
2260 2265 2270 



ggt caa ggt gcg gtg gca gtg gga ctg teg aag ctg teg gat aat ggt 6864 
Gly Gin Gly Ala Val Ala Val Gly Leu Ser Lys Leu Ser Asp Asn Gly 
2275 2280 2285 

caa tgg gta ttt aaa ate aat ggt tea gee gat ace caa ggc cat gta 6912 
Gin Trp Val Phe Lys lie Asn Gly Ser Ala Asp Thr Gin Gly His Val 
2290 2295 2300 



ggg gcg gca gtt ggt gca ggt ttt cac ttt 6942 
Gly Ala Ala Val Gly Ala Gly Phe His Phe 
2305 2310 
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Figure 8. M. catarrhalis M56 200kDa gene in pKS348. 



ATG ate ggt gca acg etc agt ggc agt get tat get caa aaa aaa gat 4 8 
Met lie Gly Ala Thr Leu Ser Gly Ser Ala Tyr Ala Gin Lys Lys Asp 
15 10 15 

acc aaa cat ate gca att ggt gaa caa aac cag cca aga cgc tea ggc 96 
Thr Lys His lie Ala lie Gly Glu Gin Asn Gin Pro Arg Arg Ser Gly 
20 25 30 

act gec aag gcg gac ggt gat cga gee att get att ggt gaa aat get 144 
Thr Ala Lys Ala Asp Gly Asp Arg Ala lie Ala lie Gly Glu Asn Ala 
35 40 45 

aac gca cag ggc ggt caa gee ate gee ate ggt agt agt aat aaa act 192 
Asn Ala Gin Gly Gly Gin Ala lie Ala lie Gly Ser Ser Asn Lys Thr 
50 55 60 

gtc aat gga age agt ttg gat aag ata ggt acc gat get acg ggt caa 24 0 
Val Asn Gly Ser Ser Leu Asp Lys lie Gly Thr Asp Ala Thr Gly Gin 
65 70 75 80 

gag tec ate gec ate ggt ggt gat gta aag get agt ggt gat gee teg 2 88 
Glu Ser lie Ala lie Gly Gly Asp Val Lys Ala Ser Gly Asp Ala Ser 
85 90 95 

att gee ate ggt agt gat gac tta cat ttg ctt gat cag cat ggt aat 336 
lie Ala lie Gly Ser Asp Asp Leu His Leu Leu Asp Gin His Gly Asn 
100 105 110 

cct aaa cat ccg aaa ggt act ctg att aac gat ctt att aac ggc cat 384 
Pro Lys His Pro Lys Gly Thr Leu lie Asn Asp Leu lie Asn Gly His 
115 120 125 

gca gta tta aaa gaa ata cga age tea aag gat aat gat gta aaa tat 432 
Ala Val Leu Lys Glu lie Arg Ser Ser Lys Asp Asn Asp Val Lys Tyr 
130 135 140 

aga cgc aca acc gca age gga cac gec agt act gca gtg gga gec atg 48 0 
Arg Arg Thr Thr Ala Ser Gly His Ala Ser Thr Ala Val Gly Ala Met 
145 150 155 160 

tea tat gca cag ggt cat ttt tec aac gee ttt ggt aca egg gca aca 52 8 
Ser Tyr Ala Gin Gly His Phe Ser Asn Ala Phe Gly Thr Arg Ala Thr 
165 170 175 

get aaa agt gec tat tec ttg gca gtg ggt ctt gee gee aca gee gag 576 
Ala Lys Ser Ala Tyr Ser Leu Ala Val Gly Leu Ala Ala Thr Ala Glu 
180 185 190 

ggc caa tct aca ate get att ggt tct gat gca aca tct age teg ttg 624 
Gly Gin Ser Thr lie Ala lie Gly Ser Asp Ala Thr Ser Ser Ser Leu 
195 200 205 

gga gcg ata gee ctt ggt gca ggt act cgt get cag eta cag ggc agt 672 
Gly Ala lie Ala Leu Gly Ala Gly Thr Arg Ala Gin Leu Gin Gly Ser 



210 



215 



220 



att gcc eta ggt caa ggt tct gtt gtc act cag agt gat aat aat tct 72 0 
lie Ala Leu Gly Gin Gly Ser Val Val Thr Gin Ser Asp Asn Asn Ser 
225 230 235 240 

aga ccg gcc tat aca cca aat acc cag gca eta gac ccc aag ttt caa 768 
Arg Pro Ala Tyr Thr Pro Asn Thr Gin Ala Leu Asp Pro Lys Phe Gin 
245 250 255 

gcc acc aat aat acg aag gcg ggt cca ctt tec att ggt agt aac tct 816 
Ala Thr Asn Asn Thr Lys Ala Gly Pro Leu Ser lie Gly Ser Asn Ser 
260 265 270 

ate aaa cgt aaa ate ate aat gtc ggt gca ggt gtt aat aaa acc gat 864 
lie Lys Arg Lys lie lie Asn Val Gly Ala Gly Val Asn Lys Thr Asp 
275 280 285 

gcg gtc aat gtg gca cag eta gaa gcg gtg gtg aag tgg get aag gag 912 
Ala Val Asn Val Ala Gin Leu Glu Ala Val Val Lys Trp Ala Lys Glu 
290 295 300 

cgt aga att act ttt cag ggt gat gat aac agt act gac gta aaa ata 960 
Arg Arg lie Thr Phe Gin Gly Asp Asp Asn Ser Thr Asp Val Lys lie 
305 310 315 320 

ggt ttg gat aat act tta act att aaa ggt ggt gca gag acc aac gca 1008 
Gly Leu Asp Asn Thr Leu Thr lie Lys Gly Gly Ala Glu Thr Asn Ala 
325 330 335 

tta acc gat aat aat ate ggt gtg gta aaa gag get gat aat agt ggt 105 6 
Leu Thr Asp Asn Asn lie Gly Val Val Lys Glu Ala Asp Asn Ser Gly 
340 345 350 

ctg aaa gtt aaa ctt get aaa act tta aac aat ctt act gag gtg aat 1104 
Leu Lys Val Lys Leu Ala Lys Thr Leu Asn Asn Leu Thr Glu Val Asn 
355 360 365 

aca act aca tta aat gcc aca acc aca gtt aag gta ggt agt agt agt 1152 
Thr Thr Thr Leu Asn Ala Thr Thr Thr Val Lys Val Gly Ser Ser Ser 
370 375 380 

agt act aca get gaa tta ttg agt gat agt tta acc ttt acc cag ccc 1200 
Ser Thr Thr Ala Glu Leu Leu Ser Asp Ser Leu Thr Phe Thr Gin Pro 
385 390 395 400 

aat aca ggc agt caa age aca age aaa acc gtc tat ggc gtt aat ggg 1248 
Asn Thr Gly Ser Gin Ser Thr Ser Lys Thr Val Tyr Gly Val Asn Gly 
405 410 415 

gtg aag ttt act aat aat gca gaa aca aca gca gca ate ggc act act 1296 
Val Lys Phe Thr Asn Asn Ala Glu Thr Thr Ala Ala lie Gly Thr Thr 
420 425 430 

cgt att acc aga gat aaa att ggc ttt get cga gat ggt gat gtt gat 1344 
Arg lie Thr Arg Asp Lys lie Gly Phe Ala Arg Asp Gly Asp Val Asp 
435 440 445 



gaa aaa caa gca cca tat ttg gat aaa aaa caa ctt aaa gtg ggt agt 13 92 
Glu Lys Gin Ala Pro Tyr Leu Asp Lys Lys Gin Leu Lys Val Gly Ser 
450 455 460 

gtt gca att acc ata gac aat ggc att gat gca ggt aat aaa aag ate 1440 
Val Ala lie Thr lie Asp Asn Gly lie Asp Ala Gly Asn Lys Lys lie 
465 470 475 480 

agt aat ctt gec aaa ggt age agt get aac gat gcg gtt acc ate gaa 1488 
Ser Asn Leu Ala Lys Gly Ser Ser Ala Asn Asp Ala Val Thr lie Glu 
485 490 495 

cag etc aaa gec gee aag cct act tta aac gca ggc get ggc ate agt 1536 
Gin Leu Lys Ala Ala Lys Pro Thr Leu Asn Ala Gly Ala Gly lie Ser 
500 505 510 

gtc aca cct act gaa ata tea gtt gat get aag agt ggc aat gtt acc 1584 
Val Thr Pro Thr Glu lie Ser Val Asp Ala Lys Ser Gly Asn Val Thr 
515 520 525 

gec cca act tac aac att ggc gtg aaa acc acc gag ctt aac agt gat 1632 
Ala Pro Thr Tyr Asn lie Gly Val Lys Thr Thr Glu Leu Asn Ser Asp 
530 535 540 

ggc act agt gat aaa ttt agt gtt aag ggt agt ggt acg aac aat age 1680 
Gly Thr Ser Asp Lys Phe Ser Val Lys Gly Ser Gly Thr Asn Asn Ser 
545 550 555 560 

tta gtt acc gee gaa cat ttg gca age tat eta aat gaa gtc aat cga 172 8 
Leu Val Thr Ala Glu His Leu Ala Ser Tyr Leu Asn Glu Val Asn Arg 
565 570 575 

acg get gac agt get eta caa age ttt acc gtt aaa gaa gaa gac gat 1776 
Thr Ala Asp Ser Ala Leu Gin Ser Phe Thr Val Lys Glu Glu Asp Asp 
580 585 590 

gat gac gee aac get ate acc gtg get aaa gat acg aca aaa aat gee 1824 
Asp Asp Ala Asn Ala lie Thr Val Ala Lys Asp Thr Thr Lys Asn Ala 
595 600 605 

ggc gca gtc age ate tta aaa etc aaa ggt aaa aac ggt eta acg gtt 1872 
Gly Ala Val Ser lie Leu Lys Leu Lys Gly Lys Asn Gly Leu Thr Val 
610 615 620 

get acc aaa aaa gat ggt acg gtt acc ttt ggg ctt age caa gat age 192 0 
Ala Thr Lys Lys Asp Gly Thr Val Thr Phe Gly Leu Ser Gin Asp Ser 
625 630 635 640 

ggt ctg acc att ggc aaa age acc eta aac aac gat ggc ttg act gtt 1968 
Gly Leu Thr lie Gly Lys Ser Thr Leu Asn Asn Asp Gly Leu Thr Val 
645 650 655 

aaa gat acc aac gaa caa ate caa gtc ggt get aat ggc att aaa ttt 2 016 
Lys Asp Thr Asn Glu Gin lie Gin Val Gly Ala Asn Gly lie Lys Phe 
660 665 670 



act aat gtg aat ggt agt aat cca ggt act ggc att gca aat acc get 2 064 
Thr Asn Val Asn Gly Ser Asn Pro Gly Thr Gly lie Ala Asn Thr Ala 
675 680 685 

cgc att acc aga gat aaa att ggc ttt get ggt tct gat ggt gca gtt 2112 
Arg lie Thr Arg Asp Lys lie Gly Phe Ala Gly Ser Asp Gly Ala Val 
690 695 700 

gat aca aac aaa cct tat ctt gat caa gac aag eta caa gtt ggc aat 2160 
Asp Thr Asn Lys Pro Tyr Leu Asp Gin Asp Lys Leu Gin Val Gly Asn 
705 710 715 720 

gtt aag att acc aac act ggc att aac gca ggt ggt aaa gec ate aca 2208 
Val Lys He Thr Asn Thr Gly He Asn Ala Gly Gly Lys Ala He Thr 
725 730 735 

999 ct 9 tec cca aca ctg cct age att gee gat caa agt age cgc aac 2256 
Gly Leu Ser Pro Thr Leu Pro Ser He Ala Asp Gin Ser Ser Arg Asn 
740 745 750 

ata gaa ctg ggc aat aca ate caa gac aaa gac aaa tec aac get gee 23 04 
He Glu Leu Gly Asn Thr He Gin Asp Lys Asp Lys Ser Asn Ala Ala 
755 760 765 

age att aat gat ata tta aat aca ggc ttt aac eta aaa aat aat aac 2352 
Ser He Asn Asp He Leu Asn Thr Gly Phe Asn Leu Lys Asn Asn Asn 
770 775 780 

aac ccc att gac ttt gtc tec act tat gac att gtt gac ttt gee aat 2400 
Asn Pro He Asp Phe Val Ser Thr Tyr Asp He Val Asp Phe Ala Asn 
785 790 795 800 

ggc aat gee acc acc gee aca gta acc cat gat acc get aac aaa acc 2448 
Gly Asn Ala Thr Thr Ala Thr Val Thr His Asp Thr Ala Asn Lys Thr 
805 810 815 

agt aaa gtg gta tat gat gtg aat gtg gat gat aca acc att cat eta 2496 
Ser Lys Val Val Tyr Asp Val Asn Val Asp Asp Thr Thr He His Leu 
820 825 830 

aca ggc act gat gac aat aaa aaa ctt ggc gtc aaa acc acc aaa ctg 2544 
Thr Gly Thr Asp Asp Asn Lys Lys Leu Gly Val Lys Thr Thr Lys Leu 
835 840 845 

aac aaa aca agt get aat ggt aat aca gca act aac ttt aat gtt aac 2592 
Asn Lys Thr Ser Ala Asn Gly Asn Thr Ala Thr Asn Phe Asn Val Asn 
850 855 860 

tct agt gat gaa gat gec ctt gtt aac gec aaa gac ate gee gaa aat 2640 
Ser Ser Asp Glu Asp Ala Leu Val Asn Ala Lys Asp He Ala Glu Asn 
865 870 875 880 

eta aac acc eta gee aag gaa att cac acc acc aaa ggc aca gca gac 2688 
Leu Asn Thr Leu Ala Lys Glu He His Thr Thr Lys Gly Thr Ala Asp 
885 890 895 

acc gee eta caa acc ttt acc gtt aaa aag gta gat gaa aat aat aat 2736 



Thr Ala Leu Gin Thr Phe Thr Val Lys Lys Val Asp Glu Asn Asn Asn 
900 905 910 



get gat gac gec aac gec ate acc gtg ggt caa aag aac gca aat aat 2784 
Ala Asp Asp Ala Asn Ala lie Thr Val Gly Gin Lys Asn Ala Asn Asn 
915 920 925 

caa gtc aac acc eta aca etc aaa ggt gaa aac ggt ctt aat att aaa 2 83 2 
Gin Val Asn Thr Leu Thr Leu Lys Gly Glu Asn Gly Leu Asn lie Lys 
930 935 940 

acc gac aaa aat ggt acg gtt acc ttt ggc att aac acc aca age ggt 2880 
Thr Asp Lys Asn Gly Thr Val Thr Phe Gly lie Asn Thr Thr Ser Gly 
945 950 955 960 

ctt aaa gec ggc aaa age acc eta aac gac ggt ggc ttg tct att aaa 2 92 8 
Leu Lys Ala Gly Lys Ser Thr Leu Asn Asp Gly Gly Leu Ser lie Lys 
965 970 975 

aac ccc act ggt age gaa caa ate caa gtc ggt get gat ggc gtg aag 2 976 
Asn Pro Thr Gly Ser Glu Gin lie Gin Val Gly Ala Asp Gly Val Lys 
980 985 990 

ttt gec aag gtt aat aat aat ggt gtt gta ggt get ggc att gat ggc 3024 
Phe Ala Lys Val Asn Asn Asn Gly Val Val Gly Ala Gly lie Asp Gly 
995 1000 1005 

aca act cgc att acc aga gat gaa att ggc ttt act ggg act aat ggc 3072 
Thr Thr Arg lie Thr Arg Asp Glu lie Gly Phe Thr Gly Thr Asn Gly 
1010 1015 1020 

tea ctt gat aaa age aaa ccc cac eta age aaa gac ggc att aac gca 312 0 
Ser Leu Asp Lys Ser Lys Pro His Leu Ser Lys Asp Gly lie Asn Ala 
1025 1030 1035 1040 

SfSft 99 fc aaa aa 9 att acc aac att caa tca SJSt 9 a 9 att 9 CC caa aac 3168 
Gly Gly Lys Lys lie Thr Asn lie Gin Ser Gly Glu lie Ala Gin Asn 
1045 1050 1055 

age cat gat get gtg aca ggc ggc aag att tat gat tta aaa acc gaa 3216 
Ser His Asp Ala Val Thr Gly Gly Lys lie Tyr Asp Leu Lys Thr Glu 
1060 1065 1070 

ctt gaa aac aaa ate age agt act gee aaa aca gca caa aac tca tta 3 2 64 
Leu Glu Asn Lys lie Ser Ser Thr Ala Lys Thr Ala Gin Asn Ser Leu 
1075 1080 1085 

cac gaa ttc tca gta gca gat gaa caa ggt aat aac ttt acg gtt agt 3312 
His Glu Phe Ser Val Ala Asp Glu Gin Gly Asn Asn Phe Thr Val Ser 
1090 1095 1100 

aac cct tac tec agt tat gac acc tca aag acc tct gat gtc ate acc 3360 
Asn Pro Tyr Ser Ser Tyr Asp Thr Ser Lys Thr Ser Asp Val lie Thr 
1105 1110 1115 1120 

ttt gca ggt gaa aac ggc att acc acc aag gta aat aaa ggt gtg gtg 340 8 
Phe Ala Gly Glu Asn Gly He Thr Thr Lys Val Asn Lys Gly Val Val 



1125 



1130 



1135 



cgt gtg ggc att gac caa acc aaa ggc tta acc acg cct aag ctg acc 3456 
Arg Val Gly lie Asp Gin Thr Lys Gly Leu Thr Thr Pro Lys Leu Thr 
1140 1145 1150 

gtg ggt aat aat aat ggc aaa ggc att gtc att gac age caa aat ggt 3504 
Val Gly Asn Asn Asn Gly Lys Gly lie Val lie Asp Ser Gin Asn Gly 
1155 1160 1165 

caa aat acc ate aca gga eta age aac act eta get aat gtt acc aat 3552 
Gin Asn Thr lie Thr Gly Leu Ser Asn Thr Leu Ala Asn Val Thr Asn 
1170 1175 1180 

gat aaa ggt age gta cgc acc aca gaa cag ggc aat ata ate aaa gac 3600 
Asp Lys Gly Ser Val Arg Thr Thr Glu Gin Gly Asn lie lie Lys Asp 
1185 1190 1195 1200 

gaa gac aaa acc cgt gee gee age att gtt gat gtg eta age gca ggc 3 64 8 
Glu Asp Lys Thr Arg Ala Ala Ser lie Val Asp Val Leu Ser Ala Gly 
1205 1210 1215 

ttt aac ttg caa ggc aat ggt gaa gcg gtt gac ttt gtc tec act tat 3696 
Phe Asn Leu Gin Gly Asn Gly Glu Ala Val Asp Phe Val Ser Thr Tyr 
1220 1225 1230 

gac acc gtc aac ttt gee gat ggc aat gec acc acc get aag gtg acc 3744 
Asp Thr Val Asn Phe Ala Asp Gly Asn Ala Thr Thr Ala Lys Val Thr 
1235 1240 1245 

tat gat gac aca age aaa acc agt aaa gtg gtc tat gat gtc aat gtg 37 92 
Tyr Asp Asp Thr Ser Lys Thr Ser Lys Val Val Tyr Asp Val Asn Val 
1250 1255 1260 

gat gat aca acc att gaa gtt aaa gat aaa aaa ctt ggc gta aaa acc 3 84 0 
Asp Asp Thr Thr lie Glu Val Lys Asp Lys Lys Leu Gly Val Lys Thr 
1265 1270 1275 1280 

acc aca ttg acc agt act ggc aca ggt get aat aaa ttt gee eta age 3888 
Thr Thr Leu Thr Ser Thr Gly Thr Gly Ala Asn Lys Phe Ala Leu Ser 
1285 1290 1295 

aat caa get act ggc gat gcg ctt gtc aag gee agt gat ate gtt get 3 93 6 
Asn Gin Ala Thr Gly Asp Ala Leu Val Lys Ala Ser Asp lie Val Ala 
1300 1305 1310 

cat eta aac acc tta tct ggc gac ate caa act gee aaa ggg gca age 3984 
His Leu Asn Thr Leu Ser Gly Asp lie Gin Thr Ala Lys Gly Ala Ser 
1315 1320 1325 

caa gcg aac aac tea gca ggc tat gtg gat get gat ggc aat aag gtc 4032 
Gin Ala Asn Asn Ser Ala Gly Tyr Val Asp Ala Asp Gly Asn Lys Val 
1330 1335 1340 

ate tat gac agt acc gat aac aag tac tat caa gec aaa aat gat ggc 4 0 80 
lie Tyr Asp Ser Thr Asp Asn Lys Tyr Tyr Gin Ala Lys Asn Asp Gly 
1345 1350 1355 1360 



aca gtt gat aaa acc aaa gaa gtt gcc aaa gac aaa ctg gtc gcc caa 4128 
Thr Val Asp Lys Thr Lys Glu Val Ala Lys Asp Lys Leu Val Ala Gin 
1365 1370 1375 

gcc caa acc cca gat ggc aca ttg get caa atg aat gtc aaa tea gtc 4176 
Ala Gin Thr Pro Asp Gly Thr Leu Ala Gin Met Asn Val Lys Ser Val 
1380 1385 1390 

att aac aaa gaa caa gta aat gat gcc aat aaa aag caa ggc ate aat 4224 
lie Asn Lys Glu Gin Val Asn Asp Ala Asn Lys Lys Gin Gly lie Asn 
1395 1400 1405 

gaa gac aac gcc ttt gtt aaa gga ctt gaa aaa gcc get tct gat aac 4272 
Glu Asp Asn Ala Phe Val Lys Gly Leu Glu Lys Ala Ala Ser Asp Asn 
1410 1415 1420 

aaa acc aaa aac gcc gca gta act gtg ggt gat tta aat gcc gtt gcc 4320 
Lys Thr Lys Asn Ala Ala Val Thr Val Gly Asp Leu Asn Ala Val Ala 
1425 1430 1435 1440 

caa aca ccg ctg acc ttt gca ggg gat aca ggc aca acg get aaa aaa 4368 
Gin Thr Pro Leu Thr Phe Ala Gly Asp Thr Gly Thr Thr Ala Lys Lys 
1445 1450 1455 

ctg ggc gag act ttg acc ate aaa ggt ggg caa aca gac acc aat aag 4416 
Leu Gly Glu Thr Leu Thr lie Lys Gly Gly Gin Thr Asp Thr Asn Lys 
1460 1465 1470 

eta acc gat aat aac ate ggt gtg gta gca ggt act gat ggc ttc act 4464 
Leu Thr Asp Asn Asn He Gly Val Val Ala Gly Thr Asp Gly Phe Thr 
1475 1480 1485 

gtc aaa ctt gcc aaa gac eta acc aat ctt aac age gtt aat gca ggt 4512 
Val Lys Leu Ala Lys Asp Leu Thr Asn Leu Asn Ser Val Asn Ala Gly 
1490 1495 1500 

ggc acc aaa att gat gac aaa ggc gtg tct ttt gta gac tea age ggt 4560 
Gly Thr Lys He Asp Asp Lys Gly Val Ser Phe Val Asp Ser Ser Gly 
1505 1510 1515 1520 

caa gcc aaa gca aac acc cct gtg eta agt gcc aat ggg ctg gac ctg 4608 
Gin Ala Lys Ala Asn Thr Pro Val Leu Ser Ala Asn Gly Leu Asp Leu 
1525 1530 1535 

ggt ggc aag gtc ate agt aat gtg ggc aaa ggc aca aaa gat acc gac 4 656 
Gly Gly Lys Val He Ser Asn Val Gly Lys Gly Thr Lys Asp Thr Asp 
1540 1545 1550 

get gcc aat gta caa cag tta aac gaa gta cgc aac ttg ttg ggt ctt 47 04 
Ala Ala Asn Val Gin Gin Leu Asn Glu Val Arg Asn Leu Leu Gly Leu 
1555 1560 1565 

ggt aat get ggt aat gat aac get gac ggc aat cag gta aac att gcc 47 52 
Gly Asn Ala Gly Asn Asp Asn Ala Asp Gly Asn Gin Val Asn He Ala 
1570 1575 1580 



gac ate aaa aaa gac cca aat tea ggt tea tea tct aac cgc act gtc 4800 
Asp lie Lys Lys Asp Pro Asn Ser Gly Ser Ser Ser Asn Arg Thr Val 
1585 1590 1595 1600 

ate aaa gca ggc acg gta ctt ggc ggt aaa ggt aat aac gat acc gaa 4 84 8 
lie Lys Ala Gly Thr Val Leu Gly Gly Lys Gly Asn Asn Asp Thr Glu 
1605 1610 1615 

aaa ctt gee act ggt ggt ata caa gtg ggc gtg gat aaa gac ggc aac 4896 
Lys Leu Ala Thr Gly Gly lie Gin Val Gly Val Asp Lys Asp Gly Asn 
1620 1625 1630 

get aac ggc gat tta age aat gtt tgg gtc aaa acc caa aaa gat ggc 4944 
Ala Asn Gly Asp Leu Ser Asn Val Trp Val Lys Thr Gin Lys Asp Gly 
1635 1640 1645 

age aaa aaa gee ctg etc gec act tat aac gee gca ggt cag acc aac 4 992 
Ser Lys Lys Ala Leu Leu Ala Thr Tyr Asn Ala Ala Gly Gin Thr Asn 
1650 1655 1660 

tat ttg acc aac aac ccc gca gaa gee att gac aga ata aat gaa caa 5040 
Tyr Leu Thr Asn Asn Pro Ala Glu Ala lie Asp Arg lie Asn Glu Gin 
1665 1670 1675 1680 

ggt ate cgc ttc ttc cat gtc aac gat ggc aat caa gag cct gtg gta 5088 
Gly He Arg Phe Phe His Val Asn Asp Gly Asn Gin Glu Pro Val Val 
1685 1690 1695 

caa ggg cgt aac ggc att gac tea agt gee tea ggc aag cac tea gtg 513 6 
Gin Gly Arg Asn Gly He Asp Ser Ser Ala Ser Gly Lys His Ser Val 
1700 1705 1710 

gcg ata ggt ttc cag gee aag gca gat ggt gaa gec gee gtt gec ata 5184 
Ala He Gly Phe Gin Ala Lys Ala Asp Gly Glu Ala Ala Val Ala He 
1715 1720 1725 

ggc aga caa acc caa gca ggc aac caa tec ate gec ate ggt gat aac 52 32 
Gly Arg Gin Thr Gin Ala Gly Asn Gin Ser He Ala He Gly Asp Asn 
1730 1735 1740 

gca caa gec acg ggc gat caa tec ate gee ate ggt aca ggc aat gtg 52 8 0 
Ala Gin Ala Thr Gly Asp Gin Ser He Ala He Gly Thr Gly Asn Val 
1745 1750 1755 1760 

gta gca ggt aag cac tct ggt gee ate ggc gac cca age act gtt aag 532 8 
Val Ala Gly Lys His Ser Gly Ala He Gly Asp Pro Ser Thr Val Lys 
1765 1770 1775 

get gat aac agt tac agt gtg ggt aat aac aac cag ttt acc gat gec 5376 
Ala Asp Asn Ser Tyr Ser Val Gly Asn Asn Asn Gin Phe Thr Asp Ala 
1780 1785 1790 

act caa acc gat gtc ttt ggt gtg ggc aat aac ate acc gtg acc gaa 5424 
Thr Gin Thr Asp Val Phe Gly Val Gly Asn Asn He Thr Val Thr Glu 
1795 1800 1805 

agt aac teg gtt gee tta ggt tea aac tct gee ate agt gca ggc aca 5472 



Ser Asn Ser Val Ala Leu Gly Ser Asn Ser Ala lie Ser Ala Gly Thr 
1810 1815 1820 



cac gca ggc aca caa gcc aaa aaa tct gac ggc aca gca ggt aca acc 5520 
His Ala Gly Thr Gin Ala Lys Lys Ser Asp Gly Thr Ala Gly Thr Thr 
1825 1830 1835 1840 

acc aca gca ggt gca acc ggt acg gtt aaa ggc ttt get gga caa acg 5568 
Thr Thr Ala Gly Ala Thr Gly Thr Val Lys Gly Phe Ala Gly Gin Thr 
1845 1850 1855 

gcg gtt ggt gcg gtc tec gtg ggt gcc tea ggt get gaa cgc cgt ate 5616 
Ala Val Gly Ala Val Ser Val Gly Ala Ser Gly Ala Glu Arg Arg lie 
1860 1865 1870 

caa aat gtg gca gca ggt gag gtc agt gcc acc age acc gat gcg gtc 5664 
Gin Asn Val Ala Ala Gly Glu Val Ser Ala Thr Ser Thr Asp Ala Val 
1875 1880 1885 

aat ggt age cag ttg tac aaa gcc acc caa age att gcc aac gca acc 5712 
Asn Gly Ser Gin Leu Tyr Lys Ala Thr Gin Ser He Ala Asn Ala Thr 
1890 1895 1900 

aat gag ctt gac cat cgt ate cac caa aac gaa aat aag gcc aat gca 5760 
Asn Glu Leu Asp His Arg He His Gin Asn Glu Asn Lys Ala Asn Ala 
1905 1910 1915 1920 

ggg att tea tea gcg atg gcg atg gcg tec atg cca caa gcc tac att 5808 
Gly He Ser Ser Ala Met Ala Met Ala Ser Met Pro Gin Ala Tyr He 
1925 1930 1935 

cct ggc aga tec atg gtt acc ggg ggt att gcc acc cac aac ggt caa 5856 
Pro Gly Arg Ser Met Val Thr Gly Gly He Ala Thr His Asn Gly Gin 
1940 1945 1950 

ggt gcg gtg gca gtg gga ctg teg aag ctg teg gat aat ggt caa tgg 5 904 
Gly Ala Val Ala Val Gly Leu Ser Lys Leu Ser Asp Asn Gly Gin Trp 
1955 1960 1965 

gta ttt aaa ate aat ggt tea gcc gat acc caa ggc cat gta ggg gcg 5952 
Val Phe Lys He Asn Gly Ser Ala Asp Thr Gin Gly His Val Gly Ala 
1970 1975 1980 

gca gtt ggt gca ggt ttt cac ttt taagccataa ategcaagat tttacttaaa 6006 
Ala Val Gly Ala Gly Phe His Phe 
1985 1990 

aatcaatctc accatagttg tataaaacag catcagcatc agtcatatta ctgatgctga 6066 
tgttttttat cacttaaacc attttacege tcaagtgatt ctctttcacc atgaccaaat 6126 
cgecattgat cataggtaaa cttattgagt aaattttatc aatgtagttg ttagatatgg 6186 
ttaaaattgt gccattgacc aaaaaatgac cgatttatcc cgaaaatttc tgattatgat 6246 
ccgttgacct gca 6259 



Figure 9 A Construction of pKS294 
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Figure 9B Construction of pKS294 
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Figure 10. Construction of pKS348 
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FIGURE 11 



Purification of r200 kDa Protein from E. coli 

E. coli Whole Cell 

Tris/NaCI, pH 8.0 
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FIGURE 12 

Expression of M56 r200 kDa Protein Gene in E. coli 




a 2h 3h 4h 6.5h 2h 3h 4h 6h b 



A B 

A: KS358 induced when O.CM600 nm was 0.26 
B: KS358 induced when O.D. at 600 nm was 0.44 

a: strain 4223 lysate 

b: KS358 cultured overnight 



FIGURE 13 
Purification of M56 r200 kDa Protein (4223) 




1 2 3 4 5 6 7 



1. E. coli Whole cells 

2. Soluble proteins after 50 mM Tris/ NaCI, pH 8, extraction 

3. Soluble proteins after Tris/ Triton X-1 00/ EDTA extraction 

4. Soluble proteins after Tris/ OG extraction 

5. Pellet after Tris/ OG extraction 
6-7. Purified 200 kDa protein 



FIGURE 14 



Anti-M56 r200 kDa Antibody Titers in Mice 
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FIGURE 15 

Anti-M56 r200 kDa Antibody Titers in Guinea Pigs 
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Figure 17 

M. catarrhalis strain 4223 200 kDa 

CCATGGATATGGGCAGGTGTGCTCGCCTGCCGTATGATGGCGATGACACCCCATTTGCCC 
10 20 30 40 50 60 



CATATCTGTACGATTTGACATGTGATATGATTTAACATGTGACATGATTTAACATTGTTT 
70 80 90 100 U0 120 



AATACTGTTGCCATCATTACCATAATTTAGTAACGCATTTAGTAACGCATTTGTAAAAAT 
130 140 150 160 170 180 



CATTGCGCCCCTTTATGTGTATCATATGAATAGAATATTATGATTGTATCTGATTATTGT 
190 200 210 220 230 240 



ATCAGAATGGTGATGCTATATGATGATGCCTACGAGTTGATTTGGGTTAATCACTCTATG 
250 260 270 280 290 300 



ATTTGATATATTTTGAAACTAATCTATTGACTTAAATCACCATATGGTTATAATTTAGCA 
310 320 330 - 340 350 360 



TAATGGTAGGCTTTTTGTAAAAATCACATCGCAATATTGTTCTACTGTTACTACCATGCT 
370 380 390 400 410 420 



TGAATGACGATCCCAATCACCAGATTCATTCAAGTGATGTGTTTGTATACGCACCATTTA 
430 440 450 460 470 480 



CCCTAATTATTTCAATCAAATGCCTATGTCAGCATGJTATCATTTTTTTAAGGTAAACCAC 
490 500 510 " 520 530 540 

I I 
NET ASN HIS HE T£R L2£S VRL HE EHE ASN LffS AIA 12 TtR 02T TEE. H3B WET HA VAL* MA 
CATGAATCACATCTATAAAGTCATCTTTAACAAAGCCACAGGCACATTTATGGCAGTGGC 
550 560 570 580 590 600 

I 

GLD TYR AIA LXS SER HIS SER THR GLY GLY GLY SER C¥5 AIA H3R GLY GLN VAL OK 39 SER 
AGAGTACGCCAAATCCCACAGCACGGGGGGGGGTAGCTGTGCTACAGGGCAAGTTGGCAG 

610 620 630 640 650 660 

I 

VAL C¥S H51 LEU SER ffiE ALA ARG ILE AIA AIA LEU AIA VAL LEU VAL 56 ILE GLY AIA THR 
TGTATGCACTCTGAGCTTTGCCCGTATTGCCGCGCTCGCTGTCCTCGTGATCGGTGCAAC 

670 680 690 700 710 720 



Figure 19 Construction of pQ WE 
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Date 


Residpnfip 

1 \COIUCI IOC 




Willowdale, Ontario, Canada. 




Citizenship 




Canadian 




Post Office Address 




Apt. 709, 120 Torresdale Avenue, Willowdale, Ontario, Canada, M2R 3N7. 









Full name of fourth inventor, if any 
Michel H. Klein 




Fourth inventor's signature 


Date 




Residence 

Willowdale, Ontario, Canada. 




Citizenship 
Canadian 


a 


Post Office Address " * 

16 Mimro Boulevard, Willowdale, Ontario, Canada, M2P 1B9. 







-sc. 

^ Full name of fifth inventor, if any 
^ Fifth inventor's signature 

Residence 
I| Citizenship 



Post Office Address 



Full name of sixth inventor, if any 

Sixth inventor's signature 

Residence 

Citizenship 

Post Office Address 



Date 



Date 
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